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(57) ABSTRACT 


The invention generally features methods for providing engi- 
neered pluripotent stem cells that can be used to study bio- 
logical response and pathways, including differentiation and 
drug effects. For example, these cells are provided compris- 
ing two or more exogenous expression cassettes including a 
selectable or screenable marker under the control of different 
condition-responsive regulatory elements, such as differen- 
tiation-responsive promoters or regulatory element of a 
receptor, drug target, drug metabolizing enzyme or signaling 
pathway gene. Also provided are sets of stem cell lines each 
comprising a different exogenous expression cassette includ- 
ing a selectable or screenable marker under the control of a 
different condition-responsive regulatory element. 
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COMPENDIUM OF READY-BUILT STEM 
CELL MODELS FOR INTERROGATION OF 
BIOLOGICAL RESPONSE 


[0001] This application is a divisional of U.S. application 
Ser. No. 13/161,149, filed Jun. 15, 2011, which claims the 
benefit of U.S. Provisional Application No. 61/354,878, filed 
Jun. 15, 2010, the entire disclosure of each of which is incor- 
porated herein by reference. 


BACKGROUND OF THE INVENTION 


[0002] 1. Field of the Invention 

[0003] The present invention relates generally to the field of 
molecular biology, stem cells and differentiated cells. More 
particularly, it concerns engineered stem cell lines that can be 
used to study biological and/or pharmaceutical response. 
[0004] 2. Description of Related Art 

[0005] A key unmet need in biomedical research and phar- 
maceutical development is reliably available, cost-effective 
and predictive models for determining biological response 
under diverse conditions as well as metabolic and toxicologi- 
cal properties of drug compounds. Current in vitro models 
such as primary cell culture suffer from inconsistent avail- 
ability and significant phenotypic variability. Current meth- 
ods used to make cells into cell lines can render the responses 
ofthe cells non-authentic. In vivo animal models are prohibi- 
tively expensive, have low throughput, and are often not pre- 
dictive for humans. 

[0006] Therefore, there is a need for production of various 
cell types in an easy-to-assay format for therapeutic and 
research use. 


SUMMARY OF THE INVENTION 


[0007] The present invention overcomes a major deficiency 
inthe art in providing pluripotent stem cells expressing one or 
more selectable or screenable marker(s) under the control of 
one or more condition-responsive regulatory elements. In a 
first embodiment, there is provided a pluripotent stem cell 
line comprising a first and second exogenous expression cas- 
sette each comprising a selectable or screenable marker under 
the control of a condition-responsive regulatory element. 
Preferably, the condition-responsive regulatory element of 
said first exogenous expression cassette is different from the 
condition-responsive regulatory element of said second exog- 
enous expression cassette. For example, the condition-re- 
sponsive regulatory elements can comprise a differentiation- 
responsive regulatory element (e.g., a tissue or cell lineage 
specific promoter) and a drug-responsive regulatory element, 
such as a drug receptor, drug target, or drug signaling path- 
way-responsive regulatory element. 

[0008] Inasecond embodiment there is provided an in vitro 
set of cell lines comprising at least a first and second cell line, 
in certain aspects, for being able to simultaneously study 
differentiation, drug response or drug toxicity of various cell 
types in a large scale. In a particular aspect, the first and 
second lines each comprise an exogenous expression cassette 
comprising a selectable or screenable marker under the con- 
trol of a condition-responsive regulatory element. Preferably, 
the condition-responsive regulatory element of the exog- 
enous expression cassette of the first cell line is different from 
the condition-responsive regulatory element of the exog- 
enous expression cassette of the second cell line. For 
example, in certain aspects, the marker of the exogenous 
cassette in the first cell line is expressed only if the cell is ina 
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first differentiation state and the marker of the exogenous 
cassette in the second cell line is expressed only if the cell is 
in a second differentiation state wherein the first and second 
differentiation states are distinct. 

[0009] In certain aspects, cell lines according to the 
embodiments are pluripotent stem cells lines, such as induced 
pluripotent stem (iPS) cell lines. Particularly, the iPS cell 
lines may be essentially free of exogenous viral genetic ele- 
ments (e.g., free from exogenous retroviral elements), or even 
more particularly, induced pluripotent stem cells repro- 
grammed by exogenous episomal vectors, such as OriP-based 
vectors. The cell lines could also be somatic cell lines. In 
some aspects, an exogenous expression cassette is integrated 
into the genome of the pluripotent stem cell line(s). The cell 
line(s) may be, for example, human or mouse cells. 

[0010] As contemplated in the present invention, the cell 
lines according to the embodiments could include a wide 
variety of condition-responsive regulatory elements which 
control differential expression in a plurality of cell types. 
Thus, the cells could be used to follow the differential expres- 
sion of any condition-responsive regulatory element such as 
tissue-specific promoter in any developmental pathway. For 
example, a set of cell lines could comprise at least three, four, 
five, six, seven, nine, ten, 20, 30, 40, 50, 100, 1000, 10,000, 
20,000 (or any range derivable therein) different pluripotent 
stem cell lines, each comprising a different exogenous 
expression cassette having a different condition-responsive 
regulatory element. In a further aspect, cell lines could com- 
prise at least three, four, five, six, seven, nine, ten, 20, 30, 40, 
50, 100, 1000, 10,000, 20,000 (or any range derivable therein) 
different exogenous expression cassettes, each comprising a 
different condition-responsive regulatory element. At least 
two exogenous expression cassettes may be comprised in 
same cells, i.e., the exogenous expression cassette of a first or 
second cell line may comprise at least two separate exog- 
enous expression cassettes, each comprising a different con- 
dition-responsive regulatory element. 

[0011] For convenience to identify different cell lines, each 
pluripotent stem cell line of a set may be contained in a 
separate container different from other cell lines in the set of 
cell lines. In alternative aspects, two or more different pluri- 
potent stem cell line may be contained in the same container. 
[0012] Ectopic expression by means ofa defined condition- 
responsive regulatory element such as a promoter or enhancer 
sequence has the distinct advantage of allowing expression to 
be regulated in a known spatial and temporal fashion. The 
power of this aspect partly relies on a collection of condition- 
responsive regulatory elements, which could respond to 
endogenous or exogenous factors by controlling or regulating 
gene expression. 

[0013] Non-limiting examples of the condition-responsive 
regulatory elements include a differentiation-responsive pro- 
moter, such as a cell-specific promoter which causes expres- 
sion of a selectable or screenable marker when the pluripotent 
stem cell of the cell line differentiates to a selected cell lin- 
eage or a tissue-specific promoter. Condition-responsive 
regulatory elements can likewise comprise a drug-responsive 
regulatory element such as a promoter of a drug metabolizing 
enzyme, a signaling-responsive promoter which causes 
expression ofa selectable or screenable marker in a cell where 
aselected drug signaling pathway, drug target or a drug recep- 
tor (or a combination thereof) is activated or repressed. As 
used herein drug refers to a molecule including, but not lim- 
ited to, small molecules, nucleic acids and proteins or com- 
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binations thereof that alter or are candidates for altering a 
phenotype associated with disease. 

[0014] Particularly, the condition-responsive regulatory 
element may comprise a differentiation-specific promoter 
which causes expression of a selectable or screenable marker 
when the pluripotent stem cell ofthe cell line differentiates to 
a selected cell lineage or tissue type. Therefore, in certain 
aspects of the invention, each pluripotent stem cell line has a 
different differentiation-specific promoter that can be used to 
indicate status of differentiation into different cell types. 
[0015] In further aspects, a cell line that comprise cell- or 
tissue-specific expression cassettes (which causes expression 
of a selectable or screenable marker when the pluripotent 
stem cell differentiates to a selected cell lineage) may com- 
prise an additional exogenous expression cassette including a 
selectable or screenable marker under the control of an addi- 
tional condition-responsive regulatory element such as a 
drug-responsive regulatory element (e.g., a receptor, drug 
target, drug metabolizing enzyme or signaling pathway-re- 
sponsive element). Therefore, after differentiating these 
pluripotent stem cells to a selected cell lineage as indicated by 
expression of the marker gene under the control of a differ- 
entiation-responsive promoter, and optionally after selection 
of enrichment of desired differentiated cells, the additional 
expression cassette may be tested for drug response or sig- 
naling regulation of the desired differentiated cells. The addi- 
tional exogenous expression cassette may, in some aspects, be 
comprised in a transposon system, for example, a piggyBac 
system. 

[0016] In certain aspects, the differentiation-responsive 
promoter could be identified by a bioinformatics analysis of 
preferentially expressed genes in a selected cell lineage, for 
example, by transcriptome sequence analysis or genome 
analysis. Such bioinformatics analysis may involve the use of 
a data storage device configured to store the transcriptome or 
genome data, a server configured to query the potential pro- 
moter sequence, or a terminal configured to report the pro- 
moter analysis result. 

[0017] Any promoter that is known in the art to be a tissue- 
specific or cell-specific promoter as well as a promoter 
responsive to a compound or up-regulation or down-regula- 
tion ofa cell signaling could be used in aspects ofthe present 
invention, such as non-limiting examples listed in Table 1. For 
example, the promoter could be specific for a selected pro- 
genitor cell, such as a neural progenitor-specific promoter, a 
hematopoietic progenitor-specific promoter, a hepatocyte 
progenitor-specific promoter, or a cardiac progenitor-specific 
promoter. In other aspects, the promoter could be specific for 
a specific degree of differentiation or a selected terminally 
differentiated cell, such as a hepatocyte, a cardiomyocyte, an 
endothelial cell, or a neuron. In further aspects, the promoter 
could be specific for a selected terminally differentiated cell 
subtype, such as a ventricular cardiomyocyte, an atrial cardi- 
omyocyte, a nodal cardiomyocyte, an arterial endothelial cell, 
a venous endothelial cell, a lymphatic endothelial cell, a 
blood-brain barrier endothelial cell, a dopaminergic neuron, a 
cholinergic neuron, a gabaergic neuron, or a motor neuron. 
[0018] In an additional aspect, the differentiation-respon- 
sive regulatory element may comprise a tissue-specific pro- 
moter such as a kidney-specific promoter, a kidney medulla- 
specific promoter, a kidney cortex-specific promoter, a heart- 
specific promoter, a pan-cardiac promoter, a heart atria- 
specific promoter, a heart ventricle-specific promoter, a liver- 
specific promoter, a neural-specific promoter, a pancreas- 
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specific promoter, a lung-specific promoter, an endothelial- 
specific promoter, a blood-specific promoter, or an intestine- 
specific promoter. 

[0019] In a further aspect, an exogenous expression cas- 
sette can be inserted into the cell lines (e.g., a cell line in a set 
of lines), wherein the additional expression cassette comprise 
a drug-responsive regulatory element, such as a promoter of a 
drug metabolizing enzyme gene. The drug metabolizing 
enzyme gene may be a cytochrome P450 monooxygenase, 
N-acetyltransferase, thiopurine methyltransferase, or dihy- 
dropyrimidine dehydrogenase. For example, the cytochrome 
P450 monooxygenase may comprise CYP1A2, CYP2C9, 
CYP2C18, CYP2C19, CYP2D6, CYP2E1, CYP3AA, or any 
allelic variants thereof. This additional expression cassette 
can be operably linked to a marker gene so that any activity 
causing up or down regulation of the marker gene expression 
can be observed external to the cells. 

[0020] In still further aspects, an expression cassette may 
comprise a drug signaling-specific promoter which causes 
expression of a marker gene in a cell wherein a selected 
signaling pathway is up-regulated or down-regulated. Non- 
limiting examples of a selected drug signaling pathway 
include tyrosine kinase pathway, heterotrimeric G protein 
pathway, small GTPase pathway, serine/threonine protein 
kinase pathway, phosphatase pathway, lipid kinase pathway, 
hydrolase pathway, cyclic AMP (cAMP)-mediated pathway, 
cyclic GMP (cGMP)-mediated pathway, phosphatidylinosi- 
tol-triphosphate (PIP3)-mediated pathway, diacylglycerol 
(DAG)-mediated pathway, inositol-triphosphate (IP3 )-medi- 
ated pathway, EF hand domains of calmodulin-mediated sig- 
naling pathway, pleckstrin homology domains of the kinase 
protein AK T-mediated signaling pathway, chromatin regula- 
tion signaling pathway, MAPK signaling pathway, apoptosis/ 
autophagy pathway, translational control pathway, cell cycle/ 
checkpoint pathway, DNA damage pathway, Jak/Stat 
signaling pathway, NF-KB signaling pathway, TGF-B/Smad 
signaling pathway, lymphocyte signaling pathway, angiogen- 
esis pathway, vesicle trafficking pathway, cytoskeletal signal- 
ing pathway, adhesion pathway, glucose metabolism path- 
way, Wnt/Hedgehog/Notch signaling pathway, stem cell 
lineage specification pathway, nuclear receptor-mediated 
pathway, or protein folding and stability signaling pathway. 

[0021] In certain aspects, a cell line(s) according to the 
embodiments may comprise an additional exogenous expres- 
sion cassette including a selectable or screenable marker 
under the control of a condition-responsive regulatory ele- 
ment such as a drug-responsive regulatory element (e.g., a 
regulatory element responsive to a drug signaling pathway or 
from a drug metabolizing enzyme gene). For example, after 
selection or enrichment of differentiated cells with cell-spe- 
cific or tissue-specific expression ofa selectable or screenable 
marker, the additional exogenous expression cassette may be 
used to mark a pathway, a receptor or drug response in such 
differentiated cells with expression of a different selectable or 
screenable marker. 

[0022] For testing drug metabolism or biological response, 
one or more of the exogenous expression cassettes may fur- 
ther comprise coding sequence for expression of one or more 
cellular receptors, signaling pathway mediators, transcription 
factors, druggable targets or cytochrome P450 monooxyge- 
nase. 

[0023] In certain aspects, each exogenous expression cas- 
sette, or particularly, each cell-specific or tissue-specific 
expression cassette in different cells, could include the same 
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selectable or screenable marker, preferably comprised in the 
same gene delivery system, such as a recombination-medi- 
ated vector. Thus, since the cassettes could be similar with the 
only difference being the condition-responsive elements, the 
construction of the cassettes and their introduction into ali- 
quots ofthe same underlying cell line, for example, by recom- 
bination, and the assembly of the sets of the cell lines can be 
performed in parallel in large numbers. 

[0024] The selectable or screenable marker under the con- 
trol of different condition-responsive regulatory element 
could serve as status indicators of these regulatory elements 
and could aid selection or enrichment of cells that express 
such markers. For example, the selectable marker could be 
further defined as an antibiotic resistance gene, such as a gene 
that confers resistance to puromycin, blastocidin, geneticin, 
tetracycline, or ampicillin. 

[0025] The selectable marker may also be an exogenous 
antigenic epitope, particularly an exogenous surface antigen 
epitope, such as a mouse CD44 protein or epitope in the 
human cell lines. For example, the mouse CD44 protein could 
be used instead of the antibiotic resistance gene for selection 
or enrichment of desired cells by magnetic cell sorting (with 
an anti-mouse CD44 antibody) from the mixture. Obviously, 
one could use any ectopically expressed surface antigenic 
epitope as the selectable marker. 

[0026] In exemplary embodiments, the screenable marker 
may be a gene that expresses a cell surface marker, a fluores- 
cent, luminescent or bioluminescent protein, an epitope, 
chloramphenicol acetyl transferase (CAT), luciferase or 
p-galactosidase. For instance, the fluorescent protein could 
be a green fluorescent protein (GFP), red fluorescent protein 
(RFP), blue fluorescent protein (BFP) or yellow fluorescent 
protein (YFP), NFAT nitroreductase or a variant thereof. 
Depending on the markers used, the selection or enrichment 
of cells may comprise fluorescence-activated cell sorting 
(FACS), CAT assay, luminescence assay or any methods 
known for an ordinary person in the art to detect or screen for 
screenable marker expression, in order to select for cells 
differentiated in a selected cell lineage or in response to a 
selected condition. An alternative or complementary 
approach is to test the presence of exogenous transcripts 
corresponding to the screenable or selectable marker in prog- 
eny cells, using conventional methods, such as RT-PCR, in 
situ hybridization, RNA array, or hybridization (e.g., North- 
ern blot). In a particular aspect, one or more ofthe exogenous 
expression cassettes may include both a selectable and a 
screenable marker, preferably comprised in a polycistronic 
transcription unit. 

[0027] To co-express multiple genes under the same con- 
ditional-responsive regulatory element, the expression cas- 
sette may comprise a polycistronic transcription unit. Such a 
polycistronic transcription unit may comprise an internal 
ribosome entry site (IRES) or a sequence coding for at least 
one protease cleavage site and/or self-cleaving peptide for 
polycistronic transcription. For example, there are several 
self-cleaving peptides such as a viral 2A peptide. 

[0028] In further embodiments, there is provided a method 
for providing engineered pluripotent stem cells, comprising 
providing a stem cell line or set of lines of pluripotent stem 
cells according to the embodiments described above. The 
method may comprise introducing different exogenous 
expression cassettes into a single cell line or into respective 
different cells. For example, the exogenous expression cas- 
settes may be introduced into the cells by a gene delivery 
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system. The gene delivery system could be a vector. Non- 
limiting examples of a vector include a viral vector, an epi- 
somal vector, a transposon-based vector, or a recombinase- 
mediated cassette exchange vector. In particular, the vector is 
a recombinase-mediated cassette exchange vector. 


[0029] For expression of screenable or selectable markers 
across cell generations, one or more ofthe exogenous expres- 
sion cassettes could be integrated or comprised into the 
genome of the cells in certain aspects of the invention. For 
example, one or more of the exogenous expression cassettes 
may be integrated or comprised at a predetermined location or 
arandom location of the genome of the cells. Particularly, the 
predetermined location may be a Rosa26 locus of the genome 
of the cells. The Rosa26 locus may be a human Rosa26 locus, 
particularly a modified locus comprising an exogenous 
expression cassette as described above. Such an exogenous 
expression cassette may be particularly flanked by recombi- 
nation recognition sites for recombination-mediated 
exchange of cassettes into the Rosa26 locus. In further 
aspects, one or more cell lines of the set comprise an addi- 
tional exogenous expression cassette comprised in a transpo- 
son system, wherein the additional expression cassette is 
different from the exogenous expression cassette comprised 
in a Rosa locus of the same cell. 


[0030] For providing a useful set of pluripotent stem cells, 
there may also comprise a method including the steps of: (a) 
providing an in vitro set of cell lines of pluripotent stem cells 
comprising a condition-responsive exogenous expression 
cassette (e.g., a cassette under the control of a differentiation- 
responsive regulatory element that regulates cell- or tissue- 
specific expression); (b) providing one or more additional 
expression cassettes under the control of condition-respon- 
sive regulatory element, such as a drug-responsive regulatory 
element of a receptor, drug target, drug metabolizing enzyme 
or signaling pathway gene; and (c) introducing the one or 
more additional expression cassettes into the in vitro set of 
cell lines. In a specific aspect, the cell lines are induced 
pluripotent stem cell lines. Particularly, the iPS cell lines may 
be essentially free of exogenous retroviral genetic elements, 
or even more particularly, derived from episomal reprogram- 
ming. The cell lines may, in certain aspects, be human or 
mouse cells. 


[0031] For example, the cell-specific or tissue-specific 
exogenous expression cassettes may be comprised in the 
genome of pluripotent stem cells, particularly, a predeter- 
mined location of the genome of pluripotent stem cells, such 
as a Rosa26 locus. In further aspects, the cell-specific or 
tissue-specific exogenous expression cassettes may be intro- 
duced into the pluripotent stem cells by a gene delivery sys- 
tem, such as a recombination-mediated cassette exchange 
vector. In other aspects, additional expression cassettes may 
be introduced into pluripotent stem cells by a transposon 
system, such as a piggyBac transposon system. The cell- 
specific or tissue-specific exogenous expression cassettes or 
the additional expression cassettes may comprise a marker 
gene under the control of respective condition-responsive 
regulatory elements. Such a marker gene may be a selectable 
marker, screenable marker, or a combination thereof. In some 
aspects, the marker gene for all the cell-specific or tissue- 
specific exogenous expression cassettes are the same. In cer- 
tain aspects, the marker gene for all the additional expression 
cassettes are the same. To serve different purposes in some 
aspects, in the same cell line the marker gene of the cell- 
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specific or tissue-specific exogenous expression cassette may 
be different from that of the additional expression cassette. 
[0032] In a further aspect, there may also be provided a 
method of providing differentiated cells, comprising the steps 
of: (a) providing an in vitro set of stem cell lines of pluripotent 
stem cells described above or pluripotent stem cells provided 
in accordance with method described above, wherein the 
pluripotent stem cells comprise a cell-specific or tissue-spe- 
cific exogenous expression cassette comprising a selectable 
or screenable marker under the control of a cell- or tissue- 
specific regulatory element; and (b) culturing the pluripotent 
stem cells under a condition to differentiate the pluripotent 
stem cells, therefore providing differentiated cells. 

[0033] If the tissue-specific, cell-specific or molecular 
(e.g., drug) pathway-specific promoter in a pluripotent stem 
cell line is activated in the stem cell or differentiated daughter 
cells, either during differentiation or in the terminal differen- 
tiation state, the selectable or screenable marker could be 
expressed and can then be detected or measured. Therefore, in 
certain aspects, the differentiation method may further com- 
prise selecting or enriching differentiated cells which express 
the selectable or screenable marker under the control of the 
cell- or tissue-specific regulatory element. The selection or 
enrichment may comprise a high-throughput purification, 
screening or imaging. For example, the selection or enrich- 
ment comprises fluorescence-activated cell sorting (FACS), 
chloramphenicol acetyltransferase (CAT) assay, or lumines- 
cence assay. 

[0034] In certain aspects, there may be provided a method 
comprising testing an effect of a test compound on the differ- 
entiated cells. For example, the test compound is a small 
molecule drug, a nucleic acid, or a peptide. Such differenti- 
ated cells used in this aspect comprise exogenous expression 
cassettes which include a selectable or screenable marker 
under the control of a regulatory element responsive to a drug 
response or signaling pathway activation, alone or in combi- 
nation with exogenous expression cassettes having cell-spe- 
cific or tissue-specific regulatory elements. 

[0035] In a particular aspect, each different exogenous 
expression cassette includes a selectable or screenable 
marker under the control of a promoter of a different drug 
metabolizing enzyme gene, such as all the variants of a P450 
gene. For example, when differentiated to hepatocytes, which 
could be selected or enriched using the expression of a select- 
able or screenable marker under the control of a hepatocyte- 
specific promoter, the differentiated cells could then be tested 
for the spectrum of drug response such as P450 responses. 
[0036] Ina further aspect, there may be provided a method 
of testing a differentiation condition, comprising the steps of: 
(a) providing a pluripotent stem cell comprising an exog- 
enous expression cassette including a selectable or screenable 
marker under the control ofa condition-responsive regulatory 
element which causes expression of the selectable or screen- 
able marker when the pluripotent stem cell differentiates to a 
selected cell lineage or tissue; and (b) culturing the pluripo- 
tent stem cell under a test condition and determining whether 
the test condition differentiate the pluripotent stem cell to a 
selected cell lineage or tissue, wherein if differentiated to the 
selected cell lineage or tissue, progeny cells of the pluripotent 
stem cell express the selectable or screenable marker. This 
method could be used to screen novel conditions that could be 
used to provide cells ofa cell type by differentiation. The test 
condition may be a drug, a peptide, a nucleic acid, or a culture 
condition. This method may also be used for other aspects of 


Aug. 6, 2015 


programming, like transdifferentiation or dedifferentiation in 
a similar manner. The method may further comprise further 
comprising selecting or enriching differentiated cells which 
express the selectable or screenable marker under the control 
of the condition-responsive regulatory element. 

[0037] There may comprise a method of testing a com- 
pound for its effect on differentiation of specific cell or tissue 
types, comprising the steps of: (a) providing a pluripotent 
stem cell comprising an exogenous expression cassette 
including a selectable or screenable marker under the control 
of a condition-responsive regulatory element which causes 
expression of the selectable or screenable marker when the 
pluripotent stem cell differentiates to a selected celllineage or 
tissue; and (b) culturing the pluripotent stem cell under a 
differentiation condition in the presence of a test compound, 
wherein the differentiation condition is capable of differenti- 
ating pluripotent stem cells into the selected cell lineage or 
tissue per se; and c) determining the expression of the select- 
able or screenable marker for the effect of the testing com- 
pound on the differentiation of the pluripotent stem cell to the 
selected cell lineage or tissue. 

[0038] In a further aspect a method of testing a compound 
(e.g., a drug) is provided, comprising the steps of: (a) provid- 
ing a pluripotent stem cell comprising (1) a first exogenous 
expression cassette comprising a selectable or screenable 
marker under the control ofa differentiation-responsive regu- 
latory element and (ii) a second exogenous expression cas- 
sette comprising a screenable marker under the control of a 
drug-responsive regulatory element, and culturing the cell 
under differentiation conditions sufficient to cause expression 
of the first expression cassette; (b) contacting the cell with a 
drug; and (c) determining a response to the drug by determin- 
ing expression of the second expression cassette. In certain 
aspects, such a method may comprise testing a plurality of 
compounds such as at least about 10, 100, 1,0000, 10,000 or 
more compounds. 

[0039] Embodiments discussed in the context of methods 
and/or compositions of the invention may be employed with 
respect to any other method or composition described herein. 
Thus, an embodiment pertaining to one method or composi- 
tion may be applied to other methods and compositions ofthe 
invention as well. 

[0040] As used herein the terms “encode” or “encoding” 
with reference to a nucleic acid are used to make the invention 
readily understandable by the skilled artisan however these 
terms may be used interchangeably with “comprise” or “com- 
prising” respectively. 

[0041] As used herein the specification, “a” or “an” may 
mean one or more. As used herein in the claim(s), when used 
in conjunction with the word “comprising”, the words “a” or 
"an" may mean one or more than one. 

[0042] The use of the term “or” in the claims is used to 
mean “and/or” unless explicitly indicated to refer to alterna- 
tives only or the alternatives are mutually exclusive, although 
the disclosure supports a definition that refers to only alter- 
natives and “and/or.” As used herein “another” may mean at 
least a second or more. 

[0043] Throughout this application, the term “about” is 
used to indicate that a value includes the inherent variation of 
error for the device, the method being employed to determine 
the value, or the variation that exists among the study sub- 
jects. 

[0044] Other objects, features and advantages of the 
present invention will become apparent from the following 
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detailed description. It should be understood, however, that 
the detailed description and the specific examples, while indi- 
cating preferred embodiments of the invention, are given by 
way of illustration only, since various changes and modifica- 
tions within the spirit and scope of the invention will become 
apparent to those skilled in the art from this detailed descrip- 
tion. 


BRIEF DESCRIPTION OF THE DRAWINGS 


[0045] The following drawings form part of the present 
specification and are included to further demonstrate certain 
aspects of the present invention. The invention may be better 
understood by reference to one or more of these drawings in 
combination with the detailed description of specific embodi- 
ments presented herein. 

[0046] FIG. 1. An illustrative exemplary embodiment of a 
Rosa26 targeting cassette (upper) and a cassette-exchanged 
hepatocyte selectable line (lower) Upper: inserted in 
between intron I and II in the native Rosa 26 locus on human 
chromosome 3, the Rosa 26 targeting cassette included, in 5' 
to 3' sequence, a 5' homologous arm for targeting, a spacer, a 
recombinase recognition site (white triangle), a protein cod- 
ing sequence from the thymidine kinase gene beginning with 
an ATG to start transcription, a 2A sequence, a second protein 
coding sequence for an antibiotic resistance gene for resis- 
tance to neomycin, a second recombinase recognition site 
(black triangle) and a 3' homologous arm. Lower: the ele- 
ments of an exogenous genetic construct in a secondary engi- 
neered iPS line constructed for selection of hepatocytes are 
shown. The secondary iPS line is made from the basal Rosa 
26 iPS line comprising the Rosa26 targeting cassette. The 
genetic construct was assembled which contained two 
expression cassettes, one cassette to permit selection of the 
desired recombinant event, and one cassette to permit tissue 
specific selection of the desired tissue type, i.e., hepatocytes. 
At the 5' end of the construct, there was the left recombination 
recognition site, followed by the protein coding sequence for 
another antibiotic resistance, designed herein as the iPS selec- 
tor. This coding sequence is driven by the native Rosa 26 
promoter to permit successful desired recombinant cells to be 
identified by resistance to the antibiotic for which the iPS 
selector confers resistance. Also in the construct, oriented in 
the opposite direction, is a construct including the promoter 
of alpha-1-antitrypsin (pAAT), which drives the expression 
of a second antibiotic selection gene, this one to be used to 
select cells when the cells have differentiated into hepato- 
cytes. In this particular construct, there are also several 
enhancer elements (designated as ApoET-4). 

[0047] FIG. 2 is an example of the common format of 
design of the genetic constructions to go into the iPS lines of 
the collection. For each insertion, there is an iPS selector 
which permits selection ofthe desired recombinant insertion. 
For each insertion, there is a tissue specific promoter, the 
promoters being different in different elements of the set, but 
each of the promoters selected for tissue specific expression. 
The tissue specific expression will be in some instances an 
organ, e.g., pan cardiac, in some instances an organ subtype, 
e.g., atrial cell, in some cases a body wide cell type, e.g., 
endothelial cell, or in some instances a level of differentiation, 
e.g., a cardiac progenitor. The tissue specific promoter actu- 
ates expression of a second gene for resistance to a second 
antibiotic resistance gene, labeled a cell type selector. A 
marker gene, such as a fluorescent protein, luciferase, a pro- 
prietary marker system, such as HaloTag or SNAP, is linked in 


Aug. 6, 2015 


expression to the cell type selector by a 2A linker, which 
works to express two distinct proteins driven by a common 
promoter. 


DESCRIPTION OF ILLUSTRATIVE 
EMBODIMENTS 


[0048] The instant invention overcomes several major 
problems with current technologies by providing methods 
and compositions related to engineered pluripotent stem cells 
that can be used to study any biological response in the human 
body. These engineered pluripotent cells provide a tool kit 
with a wide range of applications not adequately addressed by 
current technology. For example, cells are provided compris- 
ing two or more exogenous expression cassettes each com- 
prising a selectable or screenable marker under the control of 
different condition-responsive regulatory elements. These 
cells allow for simultaneous testing of two or more different 
conditions that are interrogated by the expression of screen- 
able or selectable markers from the cassettes. Thus, the engi- 
neered cells allow for rapid assessment of, for example, effi- 
cacy and toxicity of new drug candidates. In further aspects, 
the cells may be employ to develop and optimize directed cell 
differentiation protocols. 


[0049] In one example, a pluripotent stem cell according to 
the embodiments comprises at least two exogenous expres- 
sion cassettes. The first cassette includes a condition-respon- 
sive regulatory element that provides expression in response 
to a compound of interest, such as a drug candidate. The 
second cassette then includes a condition-responsive regula- 
tory element providing expression when the cell differenti- 
ates into a selected cell lineage. Accordingly, the cell line can 
be used to test cellular response to a drug candidate by cul- 
turing the cell line to differentiate cells into the lineage of 
interest. Expression from the second cassette is thus indica- 
tive of differentiation into the lineage of interest. Differenti- 
ated cells are then contacted with the drug candidate and 
expression from the first cassette is determined to assess 
response to the drug candidate. Alternatively or additionally, 
cells can be selected based on expression from the second 
cassette to provide an essentially pure population of cells ofa 
lineage of interest for testing of the drug a candidate. Thus, 
the cells provide a lineage-specific readout of the effects of 
candidate drug molecules. 


[0050] Ina related aspect, a pluripotent cell line according 
to the embodiments comprises a first exogenous expression 
cassette with a condition-responsive regulatory element that 
provides expression in response to a compound of interest 
(e.g., a drug candidate). From this cell line a panel of lines can 
be generated, each comprising at least a second exogenous 
expression cassette (e.g., a cassette comprised in a transposon 
system) with a condition-responsive regulatory element that 
is active only when cells differentiate into a selected lineage 
or cell type of interest. Thus, cell lines in the panel can be 
differentiated into an array of cell lineages. In each case, the 
differentiation status of the cells can be confirmed, or the cells 
selected, based on expression from the second expression 
cassette. The effect of a compound of interest can thereby be 
determined on a whole range of different cell lineages by 
contacting the differentiated cells with the compound and 
detecting expression of the first expression cassette. In this 
case, each cell line in the panel is able to provide information 
regarding a different differentiated cell type. As a whole, such 
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a panel could provide information on drug effect and/or tox- 
icity for essentially all ofthe cell lineages in an organ or tissue 
of interest. 

[0051] Conversely, a panel of cells can be generated using 
as a base a cell line comprising an expression cassette that 
provides expression a particular cell lineage. A panel of such 
cells is then generated with each line comprising a further 
expression cassette that provides expression upon activation a 
pathway of interest. Such cells can then be differentiated to 
the cell lineage of interest, as confirmed (or selected) by 
expression from the base expression cassette. The various 
differentiated cells in the panel are then treated with a drug or 
a panel of drugs and expression from the further expression 
cassettes is assessed to determine the effect of the drug(s) on 
a range of different metabolic pathways. In this example, an 
array of metabolic pathways in a particular cell type can be 
assayed simultaneously to provide a complete picture of the 
effect of a drug candidate or panel of candidates on the par- 
ticular cell type. 

[0052] The engineered pluripotent stem cells and panels of 
stem cells of the embodiments thus provide a highly adapt- 
able, high throughput system, for interrogating cellular 
response in any type cell or tissue and at virtually any stage of 
differentiation. The cells can be used, for example, to simul- 
taneously test the effect of drug candidates on a plurality of 
metabolic pathways and/or in a plurality of different cell 
types. Likewise the engineered cells can be used to test and 
refine differentiation conditions for producing cells types of 
interest. In this case, the ability to simultaneously integrate 
the appearance of multiple cell lineages in a population 
allows differentiation conditions to be refine to either elimi- 
nate undesirable cell lineages or to enhance the proportion of 
a lineage of interest. Moreover these cells can be used to 
develop differentiation protocols that provide a population of 
differentiated cells having a desired proportion of different 
cell types. The engineered cells, thus, constitute a new and 
powerful tool to address lineage specific differentiation and 
cellular response that was not previously available. 

[0053] Further embodiments and advantages of the inven- 
tion are described below. 


II. DEFINITIONS 


[0054] “Programming” is a process that changes a cell to 
form progeny of at least one new cell type, either in culture or 
in vivo, than it would have under the same conditions without 
programming. This means that after sufficient proliferation, a 
measurable proportion of progeny having phenotypic char- 
acteristics of the new cell type if essentially no such progeny 
could form before programming; alternatively, the proportion 
having characteristics of the new cell type is measurably more 
than before programming. This process includes differentia- 
tion, dedifferentiation and transdifferentiation. “Differentia- 
tion” is the process by which a less specialized cell becomes 
a more specialized cell type. “Dedifferentiation” is a cellular 
process in which a partially or terminally differentiated cell 
reverts to an earlier developmental stage, such as pluripo- 
tency or multipotency. “Transdifferentiation” is a process of 
transforming one differentiated cell type into another differ- 
entiated cell type. Under certain conditions, the proportion of 
progeny with characteristics of the new cell type may be at 
least about 1%, 5%, 25% or more in the order of increasing 
preference. 

[0055] “Reprogramming” is a process that confers on a cell 
a measurably increased capacity to form progeny of at least 
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one new cell type, either in culture or in vivo, than it would 
have under the same conditions without reprogramming. 
Dedifferentiation may include reprogramming. More specifi- 
cally, reprogramming is a process that confers on a somatic 
cell a pluripotent potential. This means that after sufficient 
proliferation, a measurable proportion of progeny having 
phenotypic characteristics of the new cell type if essentially 
no such progeny could form before reprogramming; other- 
wise, the proportion having characteristics of the new cell 
type is measurably more than before reprogramming. Under 
certain conditions, the proportion of progeny with character- 
istics ofthe new cell type may be at least about 1%, 5%, 2596 
or more in the order of increasing preference. 

[0056] The term “exogenous,” when used in relation to a 
protein, gene, nucleic acid, or polynucleotide in a cell or 
organism refers to a protein, gene, nucleic acid, or polynucle- 
otide which has been introduced into the cell or organism by 
artificial means, or in relation to a cell refers to a cell which 
was isolated and subsequently introduced to other cells or to 
an organism by artificial means. An exogenous nucleic acid 
may be from a different organism or cell, or it may be one or 
more additional copies of a nucleic acid which occurs natu- 
rally within the organism or cell. An exogenous cell may be 
from a different organism, or it may be from the same organ- 
ism. By way of a non-limiting example, an exogenous nucleic 
acid is in a chromosomal location different from that of natu- 
ral cells, or is otherwise flanked by a different nucleic acid 
sequence than that found in nature. 

[0057] By “expression construct" or “expression cassette” 
is meant a nucleic acid molecule that is capable of directing 
transcription. An expression construct includes, at the least, 
one or more transcriptional control elements (such as promot- 
ers, enhancers or a structure functionally equivalent thereof) 
that direct gene expression in one or more desired cell types, 
tissues or organs. Additional elements, such as a transcription 
termination signal, may also be included. 

[0058] A “vector” or “construct” (sometimes referred to as 
gene delivery system or gene transfer “vehicle”) refers to a 
macromolecule or complex of molecules comprising a poly- 
nucleotide to be delivered to a host cell, either in vitro or in 
vivo. 

[0059] A “plasmid”, a common type ofa vector, is an extra- 
chromosomal DNA molecule separate from the chromo- 
somal DNA which is capable of replicating independently of 
the chromosomal DNA. In certain cases, it is circular and 
double-stranded. 

[0060] The term “corresponds to” is used herein to mean 
that a polynucleotide sequence is homologous (i.e., is identi- 
cal, not strictly evolutionarily related) to all or a portion of a 
reference polynucleotide sequence, or that a polypeptide 
sequence is identical to a reference polypeptide sequence. In 
contradistinction, the term “complementary to” is used herein 
to mean that the complementary sequence is homologous to 
all or a portion of a reference polynucleotide sequence. For 
illustration, the nucleotide sequence *TATAC" corresponds 
to a reference sequence “TATAC” and is complementary to a 
reference sequence “GTATA”. 

[0061] A "gene" “polynucleotide,” “coding region,” 
“sequence,” “segment,” “fragment,” or “transgene” which 
“encodes” a particular protein, is a nucleic acid molecule 
which is transcribed and optionally also translated into a gene 
product, e.g., a polypeptide, in vitro or in vivo when placed 
under the control of appropriate regulatory sequences. The 
coding region may be present in either a cDNA, genomic 
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DNA, or RNA form. When present in a DNA form, the 
nucleic acid molecule may be single-stranded (1.e., the sense 
strand) or double-stranded. The boundaries ofa coding region 
are determined by a start codon at the 5' (amino) terminus and 
a translation stop codon at the 3' (carboxy) terminus. A gene 
can include, but is not limited to, cDNA from prokaryotic or 
eukaryotic mRNA, genomic DNA sequences from prokary- 
otic or eukaryotic DNA, and synthetic DNA sequences. A 
transcription termination sequence will usually be located 3' 
to the gene sequence. 

[0062] The term “control elements” refers collectively to 
promoter regions, polyadenylation signals, transcription ter- 
mination sequences, upstream regulatory domains, origins of 
replication, internal ribosome entry sites (“IRES”), enhanc- 
ers, splice junctions, and the like, which collectively provide 
for the replication, transcription, post-transcriptional pro- 
cessing and translation of a coding sequence in a recipient 
cell. Not all of these control elements need always be present 
so long as the selected coding sequence is capable of being 
replicated, transcribed and translated in an appropriate host 
cell. 

[0063] The term “promoter” is used herein in its ordinary 
sense to refer to a nucleotide region comprising a DNA regu- 
latory sequence, wherein the regulatory sequence is derived 
from a gene which is capable of binding RNA polymerase and 
initiating transcription of a downstream (3' direction) coding 
sequence. 

[0064] By “enhancer” is meant a nucleic acid sequence 
that, when positioned proximate to a promoter, confers 
increased transcription activity relative to the transcription 
activity resulting from the promoter in the absence of the 
enhancer domain. 

[0065] By “operably linked" with reference to nucleic acid 
molecules is meant that two or more nucleic acid molecules 
(e.g., a nucleic acid molecule to be transcribed, a promoter, 
and an enhancer element) are connected in such a way as to 
permit transcription of the nucleic acid molecule. “Operably 
linked” with reference to peptide and/or polypeptide mol- 
ecules is meant that two or more peptide and/or polypeptide 
molecules are connected in such a way as to yield a single 
polypeptide chain, i.e., a fusion polypeptide, having at least 
one property of each peptide and/or polypeptide component 
of the fusion. The fusion polypeptide is preferably chimeric, 
i.e., composed of heterologous molecules. 

[0066] “Homology” refers to the percent of identity 
between two polynucleotides or two polypeptides. The cor- 
respondence between one sequence and another can be deter- 
mined by techniques known in the art. For example, homol- 
ogy can be determined by a direct comparison of the sequence 
information between two polypeptide molecules by aligning 
the sequence information and using readily available com- 
puter programs. Alternatively, homology can be determined 
by hybridization of polynucleotides under conditions which 
form stable duplexes between homologous regions, followed 
by digestion with single strand-specific nuclease(s), and size 
determination of the digested fragments. Two DNA, or two 
polypeptide, sequences are “substantially homologous” to 
each other when at least about 80%, preferably at least about 
90%, and most preferably at least about 95% of the nucle- 
otides, or amino acids, respectively match over a defined 
length of the molecules, as determined using the methods 
above. 

[0067] Theterm “cell” is herein used in its broadest sense in 
the art and refers to a living body which is a structural unit of 
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tissue of a multicellular organism, is surrounded by a mem- 
brane structure which isolates it from the outside, has the 
capability of self replicating, and has genetic information and 
a mechanism for expressing it. Cells used herein may be 
naturally-occurring cells, synthetic cells, or artificially modi- 
fied cells (e.g., fusion cells, genetically modified cells, etc.). 
[0068] As used herein, the term “stem cell" refers to a cell 
capable of giving rising to at least one type ofa more special- 
ized cell. A stem cell has the ability to self-renew, i.e., to go 
through numerous cycles of cell division while maintaining 
the undifferentiated state, and has potency, i.e., the capacity to 
differentiate into specialized cell types. Typically, stem cells 
can regenerate an injured tissue. Stem cells herein may be, but 
are not limited to, embryonic stem (ES) cells, induced pluri- 
potent stem (iPS) cells, or tissue stem cells (also called tissue- 
specific stem cell, or somatic stem cell). Any artificially pro- 
duced cell which can have the above-described abilities (e.g., 
fusion cells, reprogrammed cells, orthe like used herein) may 
be a stem cell. 

[0069] “Embryonic stem (ES) cells" are pluripotent stem 
cells derived from early embryos. An ES cell was first estab- 
lished in 1981, which has also been applied to production of 
knockout mice since 1989. In 1998, a human ES cell was 
established, which is currently becoming available for regen- 
erative medicine. 

[0070] Unlike ES cells, tissue stem cells have a limited 
differentiation potential. Tissue stem cells are present at par- 
ticular locations in tissues and have an undifferentiated intra- 
cellular structure. Therefore, the pluripotency of tissue stem 
cells is typically low. Tissue stem cells have a higher nucleus/ 
cytoplasm ratio and have few intracellular organelles. Most 
tissue stem cells have low pluripotency, a long cell cycle, and 
proliferative ability beyond the life of the individual. Tissue 
stem cells are separated into categories, based on the sites 
from which the cells are derived, such as the dermal system, 
the digestive system, the bone marrow system, the nervous 
system, and the like. Tissue stem cells in the dermal system 
include epidermal stem cells, hair follicle stem cells, and the 
like. Tissue stem cells in the digestive system include pancre- 
atic (common) stem cells, liver stem cells, and the like. Tissue 
stem cells in the bone marrow system include hematopoietic 
stem cells, mesenchymal stem cells, and the like. Tissue stem 
cells in the nervous system include neural stem cells, retinal 
stem cells, and the like. 

[0071] “Induced pluripotent stem cells,’ commonly abbre- 
viated as iPS cells or iPSCs, refer to a type of pluripotent stem 
cell artificially prepared from a non-pluripotent cell, typically 
an adult somatic cell, or terminally differentiated cell, such as 
fibroblast, a hematopoietic cell, a myocyte, a neuron, an epi- 
dermal cell, or the like, by inserting certain genes, referred to 
as reprogramming factors. 

[0072] “Pluripotency” refers to a stem cell that has the 
potential to differentiate into all cells constituting one or more 
tissues or organs, or preferably, any of the three germ layers: 
endoderm (interior stomach lining, gastrointestinal tract, the 
lungs), mesoderm (muscle, bone, blood, urogenital), or ecto- 
derm (epidermal tissues and nervous system). “Pluripotent 
stem cells” used herein refer to cells that can differentiate into 
cells derived from any of the three germ layers, for example, 
direct descendants of totipotent cells or induced pluripotent 
cells. 

[0073] As used herein “totipotent stem cells" refers to cells 
has the ability to differentiate into all cells constituting an 
organism, such as cells that are produced from the fusion ofan 
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egg and sperm cell. Cells produced by the first few divisions 
of the fertilized egg are also totipotent. These cells can dif- 
ferentiate into embryonic and extraembryonic cell types. 
Pluripotent stem cells can give rise to any fetal or adult cell 
type. However, alone they cannot develop into a fetal or adult 
animal because they lack the potential to contribute to 
extraembryonic tissue, such as the placenta. 

[0074] In contrast, many progenitor cells are multipotent 
stem cells, i.e., they are capable of differentiating into a 
limited number of cell fates. Multipotent progenitor cells can 
give rise to several other cell types, but those types are limited 
innumber. An example of a multipotent stem cell is a hemato- 
poietic cell—a blood stem cell that can develop into several 
types of blood cells, but cannot develop into brain cells or 
other types of cells. At the end of the long series of cell 
divisions that form the embryo are cells that are terminally 
differentiated, or that are considered to be permanently com- 
mitted to a specific function. 

[0075] As used herein, the term “somatic cell” refers to any 
cell other than germ cells, such as an egg, a sperm, or the like, 
which does not directly transfer its DNA to the next genera- 
tion. Typically, somatic cells have limited or no pluripotency. 
Somatic cells used herein may be naturally-occurring, syn- 
thetic, or genetically modified. 


III. SOURCES OF CELLS 


[0076] In certain embodiments of the invention, there are 
disclosed methods and compositions for providing an in vitro 
set of cell lines comprising stem cells or differentiated cells 
that comprise different exogenous expression cassettes. In 
some embodiments, the cells may be stem cells, including but 
are not limited to, embryonic stem cells, fetal stem cells, or 
adult stem cells. In further embodiments, the cells may be any 
somatic cells. Thus, it will be recognized that, in certain 
aspects, cell lines according to the embodiments are made 
without destruction of human embryos. 

[0077] B. Stem Cells 

[0078] Stem cells are cells found in most, if not all, multi- 
cellular organisms. They are characterized by the ability to 
renew themselves through mitotic cell division and differen- 
tiating into a diverse range of specialized cell types. The two 
broad types of mammalian stem cells are: embryonic stem 
cells that are found in blastocysts, and adult stem cells that are 
found in adult tissues. In a developing embryo, stem cells can 
differentiate into all of the specialized embryonic tissues. In 
adult organisms, stem cells and progenitor cells act as a repair 
system for the body, replenishing specialized cells, but also 
maintain the normal turnover of regenerative organs, such as 
blood, skin or intestinal tissues. 

[0079] Human embryonic stem cells (ESCs) and induced 
pluripotent stem cells (PSC) are capable of long-term prolif- 
eration in vitro, while retaining the potential to differentiate 
into all cell types of the body, including hepatocytes. Thus 
these cells could potentially provide an unlimited supply of 
various patient-specific differentiated cells such as functional 
hepatocytes for research, drug development and transplanta- 
tion therapies. 

[0080] 2. Embryonic Stem Cells 

[0081] Embryonic stem cell lines (ES cell lines) are cul- 
tures of cells derived from the epiblast tissue of the inner cell 
mass (ICM) of a blastocyst or earlier morula stage embryos. 
A blastocyst is an early stage embryo—approximately four to 
five days old in humans and consisting of 50-150 cells. ES 
cells are pluripotent and give rise during development to all 
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derivatives of the three primary germ layers: ectoderm, endo- 
derm and mesoderm. In other words, they can develop into 
each of the more than 200 cell types of the adult body when 
given sufficient and necessary stimulation for a specific cell 
type. They do not contribute to the extraembryonic mem- 
branes or the placenta. 

[0082] Nearly all research to date has taken place using 
mouse embryonic stem cells (mES) or human embryonic 
stem cells (hES). Both have the essential stem cell character- 
istics, yet they require very different environments in order to 
maintain an undifferentiated state. Mouse ES cells may be 
grown on a layer of gelatin and require the presence of Leu- 
kemia Inhibitory Factor (LIF). Human ES cells could be 
grown on a feeder layer of mouse embryonic fibroblasts 
(MEFs) and often require the presence of basic Fibroblast 
Growth Factor (bFGF or FGF-2). Without optimal culture 
conditions or genetic manipulation (Chambers et al., 2003), 
embryonic stem cells will rapidly differentiate. 

[0083] A human embryonic stem cell may be also defined 
by the presence of several transcription factors and cell sur- 
face proteins. The transcription factors Oct-4, Nanog, and 
Sox-2 form the core regulatory network that ensures the sup- 
pression of genes that lead to differentiation and the mainte- 
nance of pluripotency (Boyer et al., 2005). The cell surface 
antigens most commonly used to identify pluripotent stem 
cells include the glycolipids SSEA3 and SSEA4 and the 
keratan sulfate antigens Tra-1-60 and Tra-1-81. 

[0084] Methods for obtaining mouse ES cells are well 
known. In one method, a preimplantation blastocyst from the 
129 strain of mice is treated with mouse antiserum to remove 
the trophoectoderm, and the inner cell mass is cultured on a 
feeder cell layer of chemically inactivated mouse embryonic 
fibroblasts in medium containing fetal calf serum. Colonies of 
undifferentiated ES cells that develop are subcultured on 
mouse embryonic fibroblast feeder layers in the presence of 
fetal calf serum to produce populations of ES cells. In some 
methods, mouse ES cells can be grown in the absence of a 
feeder layer by adding the cytokine leukemia inhibitory factor 
(LIF) to serum-containing culture medium (Smith, 2000). In 
other methods, mouse ES cells can be grown in serum-free 
medium in the presence of bone morphogenetic protein and 
LIF (Ying et al., 2003). 

[0085] Human ES cells can be obtained from blastocysts 
using previously described methods (Thomson et al., 1995; 
Thomson et al., 1998; Thomson and Marshall, 1998; Reu- 
binoff et al, 2000.) In one method, day-5 human blastocysts 
are exposed to rabbit anti-human spleen cell antiserum, then 
exposed to a 1:5 dilution of Guinea pig complement to lyse 
trophectoderm cells. After removing the lysed trophectoderm 
cells from the intact inner cell mass, the inner cell mass is 
cultured on a feeder layer of gamma-inactivated mouse 
embryonic fibroblasts and in the presence of fetal bovine 
serum. After 9 to 15 days, clumps of cells derived from the 
inner cell mass can be chemically (1.e., exposed to trypsin) or 
mechanically dissociated and replated in fresh medium con- 
taining fetal bovine serum and a feeder layer of mouse embry- 
onic fibroblasts. Upon further proliferation, colonies having 
undifferentiated morphology are selected by micropipette, 
mechanically dissociated into clumps, and replated (see U.S. 
Pat. No. 6,833,269). ES-like morphology is characterized as 
compact colonies with apparently high nucleus to cytoplasm 
ratio and prominent nucleoli. Resulting ES cells can be rou- 
tinely passaged by brief trypsinization or by selection of 
individual colonies by micropipette. In some methods, 
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human ES cells can be grown without serum by culturing the 
ES cells on a feeder layer of fibroblasts in the presence of 
basic fibroblast growth factor (Amit et al., 2000). In other 
methods, human ES cells can be grown without a feeder cell 
layer by culturing the cells on a protein matrix such as Matri- 
gel™ or laminin in the presence of “conditioned” medium 
containing basic fibroblast growth factor (Xu et al., 2001). 
The medium is previously conditioned by coculturing with 
fibroblasts. 

[0086] Methods for the isolation of rhesus monkey and 
common marmoset ES cells are also known (Thomson, and 
Marshall, 1998; Thomson et al., 1995; Thomson and Odorico, 
2000). 

[0087] Another source of ES cells are established ES cell 
lines. Various mouse cell lines and human ES cell lines are 
known and conditions for their growth and propagation have 
been defined. For example, the mouse CGR8 cell line was 
established from the inner cell mass of mouse strain 129 
embryos, and cultures of CGR8 cells can be grown in the 
presence of LIF without feeder layers. As a further example, 
human ES cell lines H1, H7, H9, H13 and H14 were estab- 
lished by Thompson et al. In addition, subclones H9.1 and 
H9.2 of the H9 line have been developed. It is anticipated that 
virtually any ES or stem cell line known in the art and may be 
used with the present invention, such as, e.g., those described 
in Yu and Thompson, 2008, which is incorporated herein by 
reference. 

[0088] The source of ES cells for use in connection with the 
present invention can be a blastocyst, cells derived from cul- 
turing the inner cell mass of a blastocyst, or cells obtained 
from cultures of established cell lines. Thus, as used herein, 
the term “ES cells" can refer to inner cell mass cells of a 
blastocyst, ES cells obtained from cultures of inner mass 
cells, and ES cells obtained from cultures of ES cell lines. 
[0089] 3. Induced Pluripotent Stem Cells 

[0090] Induced pluripotent stem (iPS) cells are cells which 
have the characteristics of ES cells but are obtained by the 
reprogramming of differentiated somatic cells. Induced pluri- 
potent stem cells have been obtained by various methods. In 
one method, adult human dermal fibroblasts are transformed 
with transcription factors Oct4, Sox2, c-Myc and KITA using 
retroviral transduction (Takahashi et al., 2007). The trans- 
formed cells are plated on SNL feeder cells (a mouse cell 
fibroblast cell line that produces LIF) in medium supple- 
mented with basic fibroblast growth factor (bFGF). After 
approximately 25 days, colonies resembling human ES cell 
colonies appear in culture. The ES cell-like colonies are 
picked and expanded on feeder cells in the presence of bFGF. 
[0091] Based on cell characteristics, cells of the ES cell- 
like colonies are induced pluripotent stem cells. The induced 
pluripotent stem cells are morphologically similar to human 
ES cells, and express various human ES cell markers. Also, 
when grown under conditions that are known to result in 
differentiation of human ES cells, the induced pluripotent 
stem cells differentiate accordingly. For example, the induced 
pluripotent stem cells can differentiate into cells having neu- 
ronal structures and neuronal markers. It is anticipated that 
virtually any iPS cells or cell lines may be used with the 
present invention, including, e.g., those described in Yu and 
Thompson, 2008. 

[0092] In another method, human fetal or newborn fibro- 
blasts are transformed with four genes, Oct4, Sox2, Nanog 
and Lin28 using lentivirus transduction (Yu et al., 2007). At 
12-20 days post infection, colonies with human ES cell mor- 
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phology become visible. The colonies are picked and 
expanded. The induced pluripotent stem cells making up the 
colonies are morphologically similar to human ES cells, 
express various human ES cell markers, and form teratomas 
having neural tissue, cartilage and gut epithelium after injec- 
tion into mice. 


[0093] Methods of preparing induced pluripotent stem 
cells from mouse are also known (Takahashi and Yamanaka, 
2006). Induction of iPS cells typically require the expression 
of or exposure to at least one member from Sox family and at 
least one member from Oct family. Sox and Oct are thought to 
be central to the transcriptional regulatory hierarchy that 
specifies ES cell identity. For example, Sox may be Sox-1, 
Sox-2, Sox-3, Sox-15, or Sox-18; Oct may be Oct-4. Addi- 
tional factors may increase the reprogramming efficiency, 
like Nanog, Lin28, KIf4, or c-Myc; specific sets of repro- 
gramming factors may be a set comprising Sox-2, Oct-4, 
Nanog and, optionally, Lin-28; or comprising Sox-2, Oct4, 
KIf and, optionally, c-Myc. 

[0094] iPS cells, like ES cells, have characteristic antigens 
that can be identified or confirmed by immunohistochemistry 
or flow cytometry, using antibodies for SSEA-1, SSEA-3 and 
SSEA-4 (Developmental Studies Hybridoma Bank, National 
Institute of Child Health and Human Development, Bethesda 
Md.), and TRA-1-60 and TRA-1-81 (Andrews et al., 1987). 
Pluripotency of embryonic stem cells can be confirmed by 
injecting approximately 0.5-10x10° cells into the rear leg 
muscles of 8-12 week old male SCID mice. Teratomas 
develop that demonstrate at least one cell type of each of the 
three germ layers. 


[0095] In certain aspects of the present invention, iPS cells 
are made from reprogramming somatic cells using repro- 
gramming factors comprising an Oct family member and a 
Sox family member, such as Oct4 and Sox2 in combination 
with KIf or Nanog as described above. The somatic cell for 
reprogramming may be any somatic cell that can be induced 
to pluripotency, such as a fibroblast, a keratinocyte, a hemato- 
poietic cell, a mesenchymal cell, a liver cell, a stomach cell, or 
a Ocell. Ina certain aspect, T cells may also be used as source 
of somatic cells for reprogramming (see U.S. application Ser. 
No. 12/478,154, incorporated herein by reference) or RNA 
transfection (see U.S. application Ser. No. 12/735,060). 


[0096] Reprogramming factors may be expressed from 
exogenous expression cassettes comprised in one or more 
vectors, such as an integrating vector or an episomal vector. In 
a further aspect, reprogramming proteins could be introduced 
directly into somatic cells by protein transduction (see U.S. 
Application No. 61/172,079, incorporated herein by refer- 
ence). 


[0097] A particular type of cell source for use in certain 
aspects of the present invention is an iPS cell line made by 
episomal reprogramming, e.g., an EBV element-based sys- 
tem (see US Publication No. 2010/0003757, incorporated 
herein by reference; Yu et al., 2009). Episomal reprogram- 
ming results in iPS cells genetically identical to the cells of 
the patient who donated the cells which were reprogrammed, 
and no foreign genetic material will be integrated into the 
genome of the reprogrammed cells by this method. The epi- 
somal reprogramming method can be done under fully 
defined conditions and is reliable, efficient and well defined. 
iPS lines made by episomal reprogramming can be differen- 
tiated into any desired lineage and reproduce infinitely in 
culture. 
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[0098] 4. Embryonic Stem Cells Derived by Somatic Cell 
Nuclear Transfer 

[0099] In certain aspects, pluripotent stem cells can be pre- 
pared by means of somatic cell nuclear transfer, in which a 
donor nucleus is transferred into a spindle-free oocyte. Stem 
cells produced by nuclear transfer are genetically identical to 
the donor nuclei. In one method, donor fibroblast nuclei from 
skin fibroblasts of a rhesus macaque are introduced into the 
cytoplasm of spindle-free, mature metaphase II rhesus 
macaque oocytes by electrofusion (Byrne et al., 2007). The 
fused oocytes are activated by exposure to ionomycin, and 
then incubated until the blastocyst stage. The inner cell mass 
of selected blastocysts are then cultured to produce embry- 
onic stem cell lines. The embryonic stem cell lines show 
normal ES cell morphology, express various ES cell markers, 
and differentiate into multiple cell types both in vitro and in 
vivo. As used herein, the term *ES cells" refers to embryonic 
stem cells derived from embryos containing fertilized nuclei. 
ES cells are distinguished from embryonic stem cells pro- 
duced by nuclear transfer, which are referred to as *embry- 
onic stem cells derived by somatic cell nuclear transfer.” 
[0100] 5. Other Stem Cells 

[0101] Fetal stem cells are cells with self-renewal capabil- 
ity and pluripotent differentiation potential. They can be iso- 
lated and expanded from fetal cytotrophoblast cells (Euro- 
pean Patent EP0412700) and chorionic villi, amniotic fluid 
and the placenta (WO/2003/042405). These are hereby incor- 
porated by reference in their entirety. Cell surface markers of 
fetal stem cells include CD117/c-kit*, SSEA3*, SSEA4* and 
SSEAI-. 

[0102] Somatic stem cells have been identified in most 
organ tissues. The best characterized is the hematopoietic 
stem cell. This is a mesoderm-derived cell that has been 
purified based on cell surface markers and functional charac- 
teristics. The hematopoietic stem cell, isolated from bone 
marrow, blood, cord blood, fetal liver and yolk sac, is the 
progenitor cell that reinitiates hematopoiesis for the life of a 
recipient and generates multiple hematopoietic lineages (see 
US. Pat. Nos. 5,635,387; 5,460,964; 5,677,136; 5,750,397; 
5,759,793; 5,681,599; 5,716,827; Hill et al., 1996). These are 
hereby incorporated by reference in their entirety. When 
transplanted into lethally irradiated animals or humans, 
hematopoietic stem cells can repopulate the erythroid, neu- 
trophil-macrophage, megakaryocyte and lymphoid hemato- 
poietic cell pool. In vitro, hematopoietic stem cells can be 
induced to undergo at least some self-renewing cell divisions 
and can be induced to differentiate to the same lineages as is 
seen in vivo. Therefore, this cell fulfills the criteria of a stem 
cell. 

[0103] The next best characterized is the mesenchymal 
stem cells (MSC), originally derived from the embryonic 
mesoderm and isolated from adult bone marrow, can differ- 
entiate to form muscle, bone, cartilage, fat, marrow stroma, 
and tendon. During embryogenesis, the mesoderm develops 
into limb-bud mesoderm, tissue that generates bone, carti- 
lage, fat, skeletal muscle and possibly endothelium. Meso- 
derm also differentiates to visceral mesoderm, which can give 
rise to cardiac muscle, smooth muscle, or blood islands con- 
sisting of endothelium and hematopoietic progenitor cells. 
Primitive mesodermal or mesenchymal stem cells, therefore, 
could provide a source for a number of cell and tissue types. 
A number of mesenchymal stem cells have been isolated (see, 
for example, U.S. Pat. Nos. 5,486,359; 5,827,735; 5,811,094; 
5,736,396; U.S. Pat. Nos. 5,837,539; 5,837,670; 5,827,740; 
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Jaiswal et al., 1997; Cassiede et al., 1996; Johnstone et al., 
1998; Yoo et al., 1998; Gronthos, 1994; Makino et al., 1999). 
These are hereby incorporated by reference in their entirety. 
Of the many mesenchymal stem cells that have been 
described, all have demonstrated limited differentiation to 
form only those differentiated cells generally considered to be 
of mesenchymal origin. To date, the most multipotent mes- 
enchymal stem cell expresses the SH2* SH4* CD29* CD44* 
CD71* CD90* CD106* CD120a* CD124* CD14 CD34- 
CD4AS* phenotype. 

[0104] Other stem cells have been identified, including gas- 
trointestinal stem cells, epidermal stem cells, neural and 
hepatic stem cells, also termed oval cells (Potten, 1998; Watt, 
1997; Alison et al, 1998). 

[0105] In some embodiments, the stem cells useful for the 
method described herein include but not limited to embryonic 
stem cells, induced plurpotent stem cells, mesenchymal stem 
cells, bone-marrow derived stem cells, hematopoietic stem 
cells, chrondrocytes progenitor cells, epidermal stem cells, 
gastrointestinal stem cells, neural stem cells, hepatic stem 
cells adipose-derived mesenchymal stem cells, pancreatic 
progenitor cells, hair follicular stem cells, endothelial pro- 
genitor cells and smooth muscle progenitor cells. 

[0106] In some embodiments, the stem cells used for the 
method described herein is isolated from umbilical cord, pla- 
centa, amniotic fluid, chorion villi, blastocysts, bone marrow, 
adipose tissue, brain, peripheral blood, the gastrointestinal 
tract, cord blood, blood vessels, skeletal muscle, skin, liver 
and menstrual blood. Stem cells prepared in the menstrual 
blood are called endometrial regenerative cells (Medistem 
Inc.). 

[0107] One ordinary skilled artisan in the art can locate, 
isolate and expand such stem cells. The detailed procedures 
forthe isolation of human stem cells from various sources are 
described in Current Protocols in Stem Cell Biology (2007) 
and it is hereby incorporated by reference in its entirety. 
Alternatively, commercial kits and isolation systems can be 
used. For example, the BD FACSAria cell sorting system, BD 
IMag magnetic cell separation system, and BD IMag mouse 
hematopoietic progenitor cell enrichment set from BD Bio- 
sciences. Methods of isolating and culturing stem cells from 
various sources are also described in U.S. Pat. Nos. 5,486, 
359, 6,991,897, 7,015,037, 7,422,736, 7,410,798, 7,410,773, 
7,399,632 and these are hereby incorporated by reference in 
their entirety. 

[0108] C. Somatic Cells 

[0109] Incertain aspects ofthe invention, there may also be 
provided engineered somatic cell lines having the exogenous 
expression cassettes. The somatic cell lines may be used in 
methods of transdifferentiation, i.e., the direct conversion of 
one somatic cell type into another, e.g., deriving hepatocytes 
from other somatic cells. 

[0110] However, the human somatic cells may be limited in 
supply, especially those from living donors. In certain aspects 
to provide a unlimited supply of starting cells, somatic cells 
may be immortalized by introduction of immortalizing genes 
or proteins, such as hTERT or oncogenes. The immortaliza- 
tion of cells may be reversible (e.g., using removable expres- 
sion cassettes) or inducible (e.g., using inducible promoters). 
[0111] Somatic cells in certain aspects of the invention may 
be primary cells (non-immortalized cells), such as those 
freshly isolated from an animal, or may be derived from a cell 
line (immortalized cells). The cells may be maintained in cell 
culture following their isolation from a subject. In certain 
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embodiments the cells are passaged once or more than once 
(e.g., between 2-5, 5-10, 10-20, 20-50, 50-100 times, or 
more) prior to their use in a method ofthe invention. In some 
embodiments the cells will have been passaged no more than 
1, 2, 5, 10, 20, or 50 times priorto their use in a method ofthe 
invention. They may be frozen, thawed, etc. 


[0112] The somatic cells used or described herein may be 
native somatic cells, or engineered somatic cells, i.e., somatic 
cells which have been genetically altered. Somatic cells ofthe 
present invention are typically mammalian cells, such as, for 
example, human cells, primate cells or mouse cells. They may 
be obtained by well-known methods and can be obtained 
from any organ or tissue containing live somatic cells, e.g., 
blood, bone marrow, skin, lung, pancreas, liver, stomach, 
intestine, heart, reproductive organs, bladder, kidney, urethra 
and other urinary organs, etc. 


[0113] Mammalian somatic cells useful in the present 
invention include, but are not limited to, Sertoli cells, endot- 
helial cells, granulosa epithelial, neurons, pancreatic islet 
cells, epidermal cells, epithelial cells, hepatocytes, hair fol- 
licle cells, keratinocytes, hematopoietic cells, melanocytes, 
chondrocytes, lymphocytes (B and T lymphocytes), erythro- 
cytes, macrophages, monocytes, mononuclear cells, cardiac 
muscle cells, and other muscle cells, etc. 


[0114] In some embodiments cells are selected based on 
their expression of an endogenous marker known to be 
expressed only or primarily in a desired cell type or expres- 
sion of an expression cassette under the control of a condi- 
tion-responsive regulatory element. For example, vimentin is 
a fibroblast marker. Other useful markers include various 
keratins, cell adhesion molecules such as cadherins, fibronec- 
tin, CD molecules, etc. The population of somatic cells may 
have an average cell cycle time of between 18 and 96 hours, 
e.g., between 24-48 hours, between 48-72 hours, etc. In some 
embodiments, at least 9096, 9596, 9896, 9996, or more of the 
cells would be expected to divide within a predetermined time 
such as 24, 48, 72, or 96 hours. 


[0115] Methods described herein may be used to program 
one or more somatic cells, e.g., colonies or populations of 
somatic cells into hepatocytes. In some embodiments a popu- 
lation of cells ofthe present invention is substantially uniform 
in that at least 9096 of the cells display a phenotype or char- 
acteristic of interest. In some embodiments at least 9596, 
96%, 97%, 98%, 99%, 99.5%, 99.8%, 99.9, 99.95% or more 
of the cells display a phenotype or characteristic of interest. In 
certain embodiments of the invention the somatic cells have 
the capacity to divide, i.e., the somatic cells are not post- 
mitotic. 


[0116] Somatic cells may be partially or completely differ- 
entiated. Differentiation is the process by which a less spe- 
cialized cell becomes a more specialized cell type. Cell dif- 
ferentiation can involve changes in the size, shape, polarity, 
metabolic activity, gene expression and/or responsiveness to 
signals of the cell. For example, hematopoietic stem cells 
differentiate to give rise to all the blood cell types including 
myeloid (monocytes and macrophages, neutrophils, baso- 
phils, eosinophils, erythrocytes, megakaryocytes/platelets, 
dendritic cells) and lymphoid lineages (T-cells, B-cells, NK- 
cells). During progression along the path of differentiation, 
the ultimate fate of a cell becomes more fixed. As described 
herein, both partially differentiated somatic cells and fully 
differentiated somatic cells can be programmed as described 
herein to produce desired cell types such as hepatocytes. 
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IV. CONDITION-RESPONSIVE REGULATORY 
ELEMENTS 


[0117] Certain aspects of the invention provide methods 
and compositions for determination of biological response 
and/or pharmacologic effects on target tissue types in cell 
populations cultured in vitro. The cells contain a variety of 
expression cassettes comprising different condition-respon- 
sive regulatory elements controlling the expression of a 
selectable or screenable marker that reflects a status change in 
the cell, like a change in differentiation status, or a toxicologic 
or metabolic change, such as may be caused by a drug can- 
didate that is present in the culture medium. The condition- 
responsive regulatory elements may be taken from a gene 
known to be upregulated when a tissue-specific, cell-specific, 
differentiation-specific, or molecular-pathway-specific 
response is activated ora particular toxicologic or other meta- 
bolic effect takes place in the cell. It controls transcription of 
a marker gene that provides an external signal that can be 
monitored as an indication of regulatory element activity. 
This system enables rapid high-throughput screening of a 
panel of culturing conditions for directed differentiation or a 
panel of test agents for potential toxicity and other metabolic 
effects on the cell. 

[0118] “A condition-responsive regulatory element,’ as 
used herein, refers to a nucleotide sequence that regulates 
(e.g., up-regulates or down-regulates) transcription in 
response to a specific cellular condition, for example, a con- 
dition involving programming to a specific cell type or acti- 
vation of a cell signaling pathway. For example, these 
sequences may be modular in nature, consisting of arrays of 
short (10- to 12-base pair) recognition elements that interact 
with specific transcription factors. Positive and negative regu- 
latory elements that function only in specific cell types or in 
response to extracellular inducers have been identified and 
could be predicted by bioinfomratic analysis. A number of 
cases of inducible and tissue-specific gene expression involve 
the activation of preexisting transcription factors, rather than 
the synthesis of new proteins. This activation may involve 
covalent modification of the protein or an allosteric change in 
its structure. 

[0119] B. Condition-Responsive Regulatory Elements 
[0120] The exogenous expression cassettes in the set of cell 
lines may include any condition-responsive regulatory ele- 
ments, especially promoters or enhancers specific for a 
selected tissue or cell lineage, or a selected signaling pathway, 
such as promoters of genes listed in Table 1. As indicated in 
the Table below, a tissue or lineage specific promoter can be 
specific to an organ, a generic cell type or a specific cell type. 
Thus a promoter, like Troponin T or alpha Myosin Heavy 
Chain, can be pan-cardiac, or expressing in all types of heart 
cells, while a promoter like sarcolipin can be used to specify 
atrial cells alone. 


TABLE 1 


Examples of promoters for cell type-specific genes 


Heart troponin T promoter (known to be pan-cardiac) 
My12V promoter (known to be ventricular specific) 
Sarcolipin promoter (reported to be atrial specific) 
Liver Alpha-1-antitrypsin (AAT) promoter (endoderm) 
Cyp3A4 promoter (hepatocyte) 
HNF4a/FOXA2/HNF6 promoter (hepatocyte) 
HNF 1b promoter (cholangiocytes) 


Pancreas PDX1 promoter (beta cells) 
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TABLE 1-continued 


Examples of promoters for cell type-specific genes 
Intestine IAP (Intestinal alkaline phosphatase) promoter (small intestine) 
Kruppel-like factor 4 (KLF4) promoter (large intestine) 
Lung Surfactant protein C (SP-C) promoter (lung epithelial type 2 
cells 
Sur: 2 ant protein B (SP-B) promoter (alveolar cells) 
Clara cell 10-Kd Protein promoter (airway Clara cells) 
Endothelial VE (vascular endothelial)-cadherin promoter (pan-endothelial) 
Epithelial Epithelial cell adhesion molecule promoter (pan-epithelial) 
Blood Vav promoter (early hematopoietic) 
Glycophorin A promoter (myeloid cells) 
Alpha-globin promoter (mature erythroid cells) 
Neuron BIII tubulin promoter (pan-neuron) 
Tyrosine hydroxylase (TH) promoter (dopaminergic) 
Glutamic Acid Decarboxylase (GAD67) promoter (GABAergic) 
Vesicular glutamate transporters (VGLUT1 or VGLUT2) 
promoter (glutamatergic) 
Glial fibrillary acidic protein (GFAP) promoter (astrocytes) 
O1 or O4 promoter (oligodendrocytes) 
Adrenal 24-dehydrocholesterol reductase promoter (cortical cell) 
gland 
Prostate Forkhead box A1 promoter (glandular cells) 
Bladder ^ Uroplakin 3A promoter (urothelial cells) 
Taste buds Gustducin promoter (taste sensory cells) 
Oral Angiomotin like 2 promoter (squamous epithelial cells) 
mucosa 
Tonsil ST6 beta-galactosamide alpha-2,6-sialyltranferase 1 
promoter (reaction center cells) 
Kidney Integrin alpha 8 promoter (glomeruli cells) 
Solute carrier family 12 (sodium/potassium/chloride 
transporters), member 1 promoter (the cells of the thick 
ascending limb of the loop of Henle in nephrons) Pollocalyxin- 
like promoter (podocytes in the Bowman's capsule) 
Testes Hydroxy-delta-5-steroid dehydrogenase, 3 beta- and steroid 
delta-isomerase 1 promoter (Leydig cells) 
Salivary — Lactoperoxidase promoter (glandular cells) 
gland 
Tooth Ameloblastin (enamal matrix protein) promoter (ameloblast) 
Endocrine Enolase 2 (gamma, neuronal) promoter (APUD cells in 
system cerebral cortex, hippocampus, lateral ventrical, and cerebellum) 
[0121] Exemplary tissue-specific regulatory elements may 


include, but are not limited to one or more, even all of regu- 
latory elements of genes specific for: 

[0122] 1) ORGANS: All organs in the human body. E.g., 
kidney, heart, liver, pancreas, intestines etc. 

[0123] 2) ORGAN SUB-FRACTIONS: All organ subfrac- 
tions in the human body. E.g., Kidney medulla, kidney cortex, 
heart atria, heart ventricle, etc. 

[0124] Example ofcell-type specific categories include, but 
are not limited to: 

[0125] 1) CELL PROGENITORS: All relevant progenitor 
cell subtypes. E.g., neural progenitors, hematopoietic pro- 
genitors, hepatocyte progenitors, cardiac progenitors, etc. 
[0126] 2) TERMINAL CELL TYPES: All terminal cell 
types in the human body. E.g., Hepatocytes, cardiomyocytes, 
endothelial cells, neurons, etc. 

[0127] 3) TERMIMINAL CELL SUBTYPES: all terminal 
cell subtypes in the human body. E.g., Ventricular cardiomyo- 
cytes, atrial cardiomyocytes, nodal cardiomyocytes, arterial 
endothelial cells, venous endothelial cells, lymphatic endot- 
helial cells, blood-brain barrier endothelial cells, dopaminer- 
gic neurons, cholinergic neurons, gabaergic neurons, motor 
neurons, etc. 

[0128] Promoters or coding sequences for genes involved 
in various signaling pathways may be used in certain aspects 
of the present invention. Specifically, any promoter or tran- 
scription control element controlling a gene that is up- or 
down-regulated in response to a change in culture or cellular 
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conditions (particularly the presence of a class of test drugs as 
well as up-regulation or down-regulation of a signaling path- 
way) may be suitable for use in certain aspects of this inven- 
tion. Those signaling pathway genes are known in the art, for 
example, available via world wide web at invitrogen.com/ 
site/us/en/home/Products-and-Services/Applications/Cell- 
and-Tissue-Analysis/Signaling-Pathways.html. 

[0129] Exemplary signaling pathway genes may be 
involved in intracellular signaling pathways include, but are 
not limited to: tyrosine kinases, heterotrimeric G proteins, 
small GTPases, serine/threonine protein kinases, phos- 
phatases, lipid kinases, hydrolases, chromatin regulation, 
MAPK signaling, Apoptosis/Autophagy, Translational Con- 
trol, Cell Cycle/Checkpoint, DNA Damage, Jak/Stat Path- 
way, NF-KB Signaling, TGF-p/Smad signaling, lymphocyte 
signaling, angiogenesis, vesicle trafficking, cytoskeletal sig- 
naling, adhesion, glucose metabolism, Wnt/Hedgehog/ 
Notch, stem cell/lineage markers, nuclear receptor, or protein 
folding and stability. Second messengers including: cyclic 
AMP (cAMP), cyclic GMP (cGMP), Phosphatidylinositol- 
triphosphate (PIP3), Diacylglycerol (DAG), Inositol-triphos- 
phate (IP3). Adapter proteins including: EF hand domains of 
calmodulin, Pleckstrin homology domains of the kinase pro- 
tein AKT or the like. 

[0130] Examples of promoters having suitable characteris- 
tics also include the following: 

[0131] Promoters for genes that respond to apoptosis, such 
as the PUMA gene. Drugs that trigger apoptosis may trigger 
promoters in this category. Other candidates are Gadd34, 
PUMA, GAHSP40, TRAIL-R2/DRS, c-fos, Gadd153, 
APAF-1, Gadd45, BTG2/PC3, Peg3/Pw1, Siahla, S29 ribo- 
somal protein, FasL/CD95L, tissue transglutaminase, 
GRP78, Nur77/NGFI-B, Cyclophilin D/CYPD, and P73. 
[0132] Promoters for genes that respond to DNA damage, 
such as the p21, p21/WAF1, or Pig3 gene. Mutagens or ter- 
atogens may trigger promoters in this category. 

[0133] Promoters for genes that respond to hyperplasia, 
such as the Ki-67 or Aurora A gene. Drugs that stimulate 
proliferation may trigger promoters in this category. 

[0134] Promoters for genes that respond to oxidative stress. 
Heme oxygenase 1 (Hmox1), and superoxide dismutase (Mn- 
SOD) are upregulated with low oxygen levels; y-glutamyl 
cysteinyl ligase (GCL), and Metallothionine I and II are 
upregulated by depletion of glutathione, or the presence of 
metal ions, respectively. Other candidates are IkB, ATF4, 
xanthine oxidase, COX2, iNOS, Ets-2, Cyclophilin A/CYPA, 
NQOI, and bNIP3. 

[0135] Promoters for transcription factors that reflect 
changes in gene expression profiles upon initiation of any of 
these events, such as the PXR, CAR, aryl hydrocarbon recep- 
tor (AhR), or Nrf2 gene 

[0136] Promoters for other hepatocyte markers that are 
upregulated in liver toxicity, such as Lrg-21, SOCS-2, SOCS- 
3, PAI-I, GBP28/adiponectin, al-acid glycoprotein, ATF3, 
and Igfbp-3. 

[0137] Promoters for genes that are responsive to receptors 
that act in the nucleus, exemplified by androgen, estrogen, 
and pPAG responsive gene. Àn example is the gene for pros- 
tate specific antigen (PSA). 

[0138] Promoters for hepatocyte enzymes involved in drug 
metabolism that are also upregulated in the presence of sub- 
strate. Exemplary are cytochrome P450 genes, such as 
CYP3A4 and CYP1A1. 
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[0139] Promoter for drug transporter genes also upregu- 
lated by substrate, such as MDRI. 

[0140] Promoters for genes that affect the contraction rate 
or the QT interval of the heart, such as calcium flux genes. 
[0141] Promoters for genes controlling a product that is 
deficient in certain clinical conditions, and for which it may 
be useful to screen drugs that can regulate expression. Exem- 
plary are genes that control hormone expression (e.g., insulin, 
orcortisol), and genes that control synthesis, release, metabo- 
lism, or reuptake of neurotransmitters (e.g., the serotonin 
transporter and tyrosine hydroxylase). 

[0142] These and other promoters referred to in this disclo- 
sure can be cloned by amplification from a suitable genomic 
library using primers specific for the desired sequence, con- 
structed using sequence data from such sources as GenBank. 
[0143] In a particular example, tissue-specific transgene 
expression, especially for marker gene expression in hepato- 
cytes derived from programming, is desirable as a way to 
identify derived hepatocytes. To increase both specificity and 
activity, the use of cis-acting regulatory elements has been 
contemplated. For example, a hepatocyte-specific promoter 
may be used, such as a promoter of albumin, a--1-antitrypsin 
(AAT), cytochrome p450 3A4 (CYP3A4), apolipoprotein 
A-I, or APOE. 

[0144] In certain aspects, this also concerns enhancer 
sequences, i.e., nucleic acid sequences that increase a pro- 
moter’s activity and that have the potential to act in cis, and 
regardless of their orientation, even over relatively long dis- 
tances (up to several kilobases away from the target pro- 
moter). However, enhancer function is not necessarily 
restricted to such long distances as they may also function in 
close proximity to a given promoter. For the liver, numerous 
approaches to incorporate such organ-specific regulatory 
sequences into retroviral, lentiviral, adenoviral and adeno- 
associated viral vectors or non-viral vectors (often in addition 
to house-keeping hepatocyte-specific cellular promoters) 
have been reported so far (Ferry et al., 1998; Ghosh et al., 
2000; Miao et al., 2000; Follenzi et al., 2002). 

[0145] Several enhancer sequences for liver-specific genes 
have been documented. WO2009130208 describes several 
liver-specific regulatory enhancer sequences. WO95/011308 
describes a gene therapy vector comprising a hepatocyte- 
specific control region (HCR) enhancer linked to a promoter 
and a transgene. The human apolipoprotein E-Hepatocyte 
Control Region (ApoE-HCR) is a locus control region (LCR) 
for liver-specific expression of the apolipoprotein E (ApoE) 
gene. The ApoE-HCR is located in the ApoE/CI/CII locus, 
has a total length of 771 bp and is important in expression of 
the genes ApoE and ApoC-1 in the liver (Simonet et al., 
1993). In WO01/098482, the combination of this specific 
ApoE enhancer sequence or a truncated version thereof with 
hepatic promoters is suggested. It was shown that vector 
constructs combining the (non-truncated) ApoE-HCR 
enhancer with a human alpha-antitrypsin (AAT) promoter 
were able to produce the highest level of therapeutic protein 
in vivo (Miao et al., 2000) and may confer sustained expres- 
sion when used in conjunction with a heterologous transgene 
(Miao et al., 2001). 

[0146] Other chimeric liver-specific constructs have also 
been proposed in the literature, e.g., with the AAT promoter 
and the albumin or hepatitis B enhancers (Kramer et al., 
2003), or the alcohol dehydrogenase 6 (ADH6) basal pro- 
moter linked to two tandem copies of the apolipoprotein E 
enhancer element (Gehrke et al., 2003). The authors of the 
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latter publication stress the importance of the relatively small 
size (1068 bp) of this enhancer-promoter combination. 
[0147] C. Promoter Identification and Characterization 
[0148] Currently a collection of over 17,000 human pro- 
moters (available from SwitchGear Genomics) could be inte- 
grated into exogenous expression cassettes of the present 
invention. 

[0149] Regulation is the complex orchestration of events 
starting with an extracellular signal such as a hormone and 
leading to an increase or decrease in the activity of one or 
more proteins. Bioinformatics techniques could be applied to 
explore various steps in this process. For example, promoter 
analysis involves the identification and study of sequence 
motifs in the DNA surrounding the coding region of a gene. 
These motifs influence the extent to which that region is 
transcribed into mRNA. 

[0150] Regulation of expression is determined to a large 
extent by the promoter sequences of the individual genes 
(and/or enhancers). The complete sequence of the human 
genome now provides the molecular basis for the identifica- 
tion of many regulatory regions. For example, promoter 
sequences for specific cDNAs can be obtained reliably from 
genomic sequences by exon mapping. In the many cases in 
which cDNAs are 5'-incomplete, high quality promoter pre- 
diction tools can be used to locate promoters directly in the 
genomic sequence. 

[0151] Significant improvements in promoter prediction 
have been made within the last few years. PromoterScan 
(Prestridge, 1995) has been viewed as one of the first pro- 
moter prediction algorithms with acceptably high specificity. 
Recently, PromoterInspector (Scherf et al., 2000) and Dragon 
Promoter Finder (Bajic et al., 2002) made further progress in 
specificity and sensitivity of promoter prediction algorithms. 
PromoterScan identifies promoters using a TATA box posi- 
tional weight matrix combined with the density of specific 
transcription factor binding sites. The algorithm has been 
demonstrated to be of comparatively high specificity but low 
sensitivity. 

[0152] An effective promoter identification algorithm, 
which is called PromoterExplorer, has been proposed 
recently by Xie et al. (2006). In this approach, various fea- 
tures such as local distribution of pentamers, positional CpG 
island features and digitized DNA sequence are combined to 
build a high-dimensional input vector and then a cascade 
AbaBoost algorithm is used both to perform feature selection 
and classifier training. 

[0153] Expression data can also be used to infer gene regu- 
lation: one might compare microarray data from a wide vari- 
ety of states of an organism to form hypotheses about the 
genes involved in each state. In a single-cell organism, one 
might compare stages of the cell cycle, along with various 
stress conditions (heat shock, starvation, etc.). One can then 
apply clustering algorithms to that expression data to deter- 
mine which genes are co-expressed. For example, the 
upstream regions (promoters) of co-expressed genes can be 
searched for over-represented regulatory elements. 


V. EXPRESSION CASSETTES 


[0154] The present invention involve use of exogenous 
expression cassettes including a condition-responsive regu- 
latory element that regulates the expression of a selectable or 
screenable marker that provides an external signal for moni- 
toring the regulatory element activity. In certain aspects, the 
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expression cassettes can convey a polycistronic message for 
efficient co-expression of multiple genes. 

[0155] B. Polycistronic Message 

[0156] In certain aspects of the present invention, the flex- 
ibility and efficient expression from this polycistronic system 
underlie its advantages and establish it as a useful tool to 
provide engineered cells. The various permutations of this 
system include but are not limited to: 1) including at least two 
markers in a polycistronic transcript, such as both a selectable 
and a screenable marker, two different selectable markers or 
two different screenable markers, 2) including one or more 
markers in combination with a non-marker coding sequence 
such as a drug metabolism enzyme gene or programming 
gene, or 3) creating a cassette with at least three coding 
sequences, for example, using at least two IRES sites or 2A 
peptides. 

[0157] 2. Protease Cleavage Site or Self-Cleaving Peptide 
for Polycistronic Expression 

[0158] In certain aspects, according to the present inven- 
tion, the genes encoding markers or other proteins may be 
connected to one another by a sequence (there may be more 
than one) coding for a protease cleavage site (i.e., a sequence 
comprising the recognition site of a protease) or at least one 
self-cleaving peptide. 

[0159] According to a preferred embodiment of the present 
invention the protease(s) capable of cleaving the cleavage 
sites encoded by the sequence(s) connecting the genes con- 
stituting the polycistronic message is/are encoded by the 
polynucleotide of the present invention. More preferably, the 
gene(s) encoding the protease(s) is/are part of at least one of 
the polycistronic meassage. 

[0160] Suitable protease cleavages sites and self-cleaving 
peptides are known to the skilled person (see, e.g., in Ryan et 
al, 1997; Scymezak et al., 2004). Preferred examples of 
protease cleavage sites are the cleavage sites of potyvirus NIa 
proteases (e.g., tobacco etch virus protease), potyvirus HC 
proteases, potyvirus P1 (P35) proteases, byovirus Nla pro- 
teases, byovirus RNA-2-encoded proteases, aphthovirus L 
proteases, enterovirus 2A proteases, rhinovirus 2A proteases, 
picorna 3C proteases, comovirus 24K proteases, nepovirus 
24K proteases, RTSV (rice tungro spherical virus) 3Ciike 
protease, PY\IF (parsnip yellow fleck virus) 3C-like protease, 
thrombin, factor Xa and enterokinase. 

[0161] Due to its high cleavage stringency, TEV (tobacco 
etch virus) protease cleavage sites are particularly preferred. 
Thus, the genes of the polygenes according to the present 
invention are preferably connected by a stretch of nucleotides 
comprising a nucleotide sequence encoding an amino acid 
sequence of the general, form E)(XY XQ(G/S) wherein X 
represents any amino acid (cleavage by TEV occurs between 
Q and G or Q. and S). Most preferred are linker nucleotide 
sequences coding for ENLYFQG and ENLYFQS, respec- 
tively. 

[0162] Preferred self-cleaving peptides (also called “cis- 
acting hydrolytic elements", CHYSEL; see deFelipe (2002)) 
are derived from potyvirus and cardiovirus 2A peptides. 
Especially preferred self-cleaving peptides are selected from 
2A peptides derived from FMDV (foot-and-mouth disease 
virus), equine rhinitis A virus, Thosea asigna virus and por- 
cine teschovirus. 

[0163] The polypeptides encoded by the nucleotide 
sequences constituting the polycistronic meassage of the 
present invention may be the same or different. Thus, each 
polygene present in the constructs of the invention may con- 
tain one or more copy of each nucleotide sequence encoding 
a protein of interest. 
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[0164] 3.IRES 

[0165] In certain embodiments of the invention, the use of 
internal ribosome entry sites (IRES) elements are used to 
create multigene, or polycistronic, messages. IRES elements 
are able to bypass the ribosome scanning model of 5' methy- 
lated Cap dependent translation and begin translation at inter- 
nal sites (Pelletier and Sonenberg, 1988). IRES elements 
from two members of the picornavirus family (polio and 
encephalomyocarditis) have been described (Pelletier and 
Sonenberg, 1988), as well an IRES from a mammalian mes- 
sage (Macejak and Sarnow, 1991). IRES elements can be 
linked to heterologous open reading frames. Multiple open 
reading frames can be transcribed together, each separated by 
an IRES, creating polycistronic messages. By virtue of the 
IRES element, each open reading frame is accessible to ribo- 
somes for efficient translation. Multiple genes can be effi- 
ciently expressed using a single promoter/enhancer to tran- 
scribe a single message (see U.S. Pat. Nos. 5,925,565 and 
5,935,819, each herein incorporated by reference). 

[0166] Most eukaryotic and viral messages initiate transla- 
tion by a mechanism involving recognition of a 7-methylgua- 
nosine cap at the 5' end of the mRNA. Ina few cases, however, 
translation occurs via a cap-independent mechanism in which 
an internal ribosome entry site (IRES) positioned 3' down- 
stream of the gene translated from the cap region of the 
mRNA is recognized by the ribosome, allowing translation of 
a second coding region from the transcript. Therefore, IRES 
elements are able to bypass the ribosome scanning model of 
5' methylated Cap dependent translation and begin translation 
at internal sites (Pelletier and Sonenberg, 1988). 

[0167] This is particularly important in the present inven- 
tion as an IRES sequence allows simultaneous expression of 
multiple proteins from a single genetic locus. IRES elements 
can be linked to heterologous open reading frames. Multiple 
open reading frames can be transcribed together, each sepa- 
rated by an IRES, creating polycistronic messages. By virtue 
of the IRES element, each open reading frame is accessible to 
ribosomes for efficient translation. Multiple genes can be 
efficiently expressed using a single promoter/enhancer to 
transcribe a single message (see U.S. Pat. Nos. 5,925,565 and 
5,935,819, each herein incorporated by reference). 

[0168] A particularly preferred embodiment involves 
including coding sequences for both a desired recombinant 
product and a selectable or screenable marker within the same 
polycistronic transcript. Successful transformation events are 
marked by both expression of the desired reprogramming 
factors or drug-responsive genes and the easily detectable 
selectable or screenable markers, facilitating selection of suc- 
cessfully transfected cells. 

[0169] IRES elements from two members of the picornavi- 
rus family (polio and encephalomyocarditis) have been 
described (Pelletier and Sonenberg, 1988), as well an IRES 
from a mammalian message (Macejak and Sarnow, 1991). 
Certain examples include those IRES elements from poliovi- 
rus Type I, the 5'UTR of encephalomyocarditis virus (EMV), 
of “Thelier’s murine encephalomyelitis virus (TMEV), of 
“foot and mouth disease virus” (FMDV), of “bovine enterovi- 
rus (BEV), of “coxsackie B virus" (CBV), or of “human 
rhinovirus” (HRV), or the “human immunoglobulin heavy 
chain binding protein” (BIP) 5'UTR, the Drosophila anten- 
napediae 5'UTR or the Drosophila ultrabithorax S'UTR, or 
genetic hybrids or fragments from the above-listed 
sequences. IRES sequences are described in Kim et al. (1992) 
and McBratney et al. (1993). 

[0170] In certain embodiments, a polycistronic transcript 
may be used by employing one or more internal ribosome 
entry sites (IRESs). Exemplary IRES may be an encepha- 
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lomyocarditis virus IRES, a picornavirus IRES, a foot-and- 
mouth disease virus IRES, a hepatitis A virus IRES, a hepa- 
titis C virus IRES, a human rhinovirus IRES, a poliovirus 
IRES, a swine vesicular disease virus IRES, a turnip mosaic 
potyvirus IRES, a human fibroblast growth factor 2 mRNA 
IRES, a pestivirus IRES, a Leishmania RNA virus IRES, a 
Moloney murine leukemia virus IRES a human rhinovirus 14 
IRES, an aphthovirus IRES, a human immunoglobulin heavy 
chain binding protein mRNA IRES, a Drosophila Antenna- 
pedia mRNA IRES, a human fibroblast growth factor 2 
mRNA IRES, a hepatitis G virus IRES, a tobamovirus IRES, 
a vascular endothelial growth factor mRNA IRES, a Cox- 
sackie B group virus IRES, a c-myc protooncogene mRNA 
IRES, a human MYT2 mRNA IRES, a human parechovirus 
type 1 virus IRES, a human parechovirus type 2 virus IRES, 
a eukaryotic initiation factor 4GI mRNA IRES, a P/autia stali 
intestine virus IRES, a Theiler's murine encephalomyelitis 
virus IRES, a bovine enterovirus IRES, a connexin 43 mRNA 
IRES, a homeodomain protein Gtx mRNA IRES, an AML1 
transcription factor mRNA IRES, an NF-kappa B repressing 
factor mRNA IRES, an X-linked inhibitor of apoptosis 
mRNA IRES, a cricket paralysis virus RNA IRES, a p58 
(PITSLRE) protein kinase mRNA IRES, an ornithine decar- 
boxylase mRNA IRES, a connexin-32 mRNA IRES, a bovine 
viral diarrhea virus IRES, an insulin-like growth factor I 
receptor mRNA IRES, a human immunodeficiency virus type 
1 gag gene IRES, a classical swine fever virus IRES, a Kapo- 
si's sarcoma-associated herpes virus IRES, a short IRES 
selected from a library of random oligonucleotides, a Jem- 
brana disease virus IRES, an apoptotic protease-activating 
factor 1 mRNA IRES, a Rhopalosiphum padi virus IRES, a 
cationic amino acid transporter mRNA IRES, a human insu- 
lin-like growth factor II leader 2 mRNA IRES, a giardiavirus 
IRES, a Smad5 mRNA IRES, a porcine teschovirus-1 talfan 
IRES, a Drosophila Hairless mRNA IRES, an hSNMI 
mRNA IRES, a Cbfal/Runx2 mRNA IRES, an Epstein-Barr 
virus IRES, a hibiscus chlorotic ringspot virus IRES, a rat 
pituitary vasopressin V1b receptor mRNA IRES, a human 
hsp70 mRNA IRES, or a variant thereof. 

[0171] C. Selection and Screenable Markers 

[0172] In certain embodiments of the invention, cells con- 
taining a nucleic acid construct of the present invention may 
be identified in vitro or in vivo by including a marker in the 
expression cassette. Such markers would confer an identifi- 
able change to the cell permitting easy identification of cells 
containing the expression cassette. Generally, a selection 
marker is one that confers a property that allows for selection. 
A positive selection marker is one in which the presence of the 
marker allows for its selection, while a negative selection 
marker is one in which its presence prevents its selection. An 
example of a positive selection marker is a drug resistance 
marker. 

[0173] | Usually the inclusion ofa drug selection marker aids 
in the cloning and identification of transformants, for 
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example, genes that confer resistance to neomycin, puromy- 
cin, blastocidin, geneticin, hygromycin, DHFR, GPT, zeocin 
and histidinol are useful selection markers. In addition to 
markers conferring a phenotype that allows for the discrimi- 
nation of transformants based on the implementation of con- 
ditions, other types of markers including screenable markers 
such as GFP, whose basis is colorimetric analysis, are also 
contemplated. Alternatively, screenable enzymes as negative 
selection markers such as herpes simplex virus thymidine 
kinase (tk) or chloramphenicol acetyltransferase (CAT) may 
be utilized. One of skill in the art would also know how to 
employ immunologic markers, possibly in conjunction with 
FACS analysis. The marker used is not believed to be impor- 
tant, so long as it is capable of being expressed simulta- 
neously with the nucleic acid encoding a gene product. Fur- 
ther examples of selection and screenable markers are well 
known to one of skill in the art. 

[0174] Certain embodiments of the present invention uti- 
lize screenable reporter genes to indicate specific property of 
cells, for example, differentiation along a defined cell lineage 
by activating a condition-responsive regulatory element 
which controls the reporter marker gene expression. 

[0175] Examples of such reporters include genes encoding 
cell surface proteins (e.g., CD4, HA epitope), fluorescent 
proteins, antigenic determinants and enzymes (e.g., B-galac- 
tosidase or a nitroreductase). The vector containing cells may 
be isolated, e.g., by FACS using fluorescently-tagged anti- 
bodies to the cell surface protein or substrates that can be 
converted to fluorescent products by a vector encoded 
enzyme. In certain aspects cell-permeable dyes can be used to 
identify cells expressing a resporter. For example, expression 
ofa NFAT nitroreductase gene can be detected by using a cell 
permeable pro-fluorogenic substrate such as CytoCyS5S (see, 
e.g., U.S. Pat. Nos. 5,633,158, 5,780,585, 5,977,065 and EP 
Patent No. EP 1252520, each incorporate herein by refer- 
ence). 

[0176] Inspecific embodiments, the reporter gene is a fluo- 
rescent protein. A broad range of fluorescent protein genetic 
variants have been developed that feature fluorescence emis- 
sion spectral profiles spanning almost the entire visible light 
spectrum (see Table 2 for non-limiting examples). Mutagen- 
esis efforts in the original Aequorea victoria jellyfish green 
fluorescent protein have resulted in new fluorescent probes 
that range in color from blue to yellow, and are some of the 
most widely used in vivo reporter molecules in biological 
research. Longer wavelength fluorescent proteins, emitting in 
the orange and red spectral regions, have been developed 
from the marine anemone, Discosoma striata, and reef corals 
belonging to the class Anthozoa. Still other species have been 
mined to produce similar proteins having cyan, green, yellow, 
orange, and deep red fluorescence emission. Developmental 
research efforts are ongoing to improve the brightness and 
stability of fluorescent proteins, thus improving their overall 
usefulness. 


TABLE 2 


Fluorescent Protein Properties 


Protein 
(Acronym) 


GFP (wt) 


EGFP 
AcGFP 


Relative 
Excitation Emission Molar Brightness 
Maximum Maximum Extinction Quantum in vivo (96 of 
(nm) (nm) Coefficient Yield Structure EGFP) 
395/475 509 21,000 0.77  Monomer* 48 
Green Fluorescent Proteins 
484 507 56,000 0.60 | Monomer* 100 
480 505 50,000 0.55 Monomer* 82 
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Fluorescent Protein Properties 


Excitation Emission Molar 


Protein Maximum Maximum Extinction Quantum in vivo 
(Acronym) (nm) (nm) Coefficient Yield Structure 
TurboGFP 482 502 70,000 0.53 Monomer* 
Emerald 487 509 57,500 0.68 Monomer* 
Azami 492 505 55,000 0.74 Monomer 
Green 
ZsGreen 493 505 43,000 0.91 Tetramer 
Blue Fluorescent Proteins 
EBFP 383 445 29,000 0.31 Monomer* 
Sapphire 399 511 29,000 0.64 Monomer* 
T-Sapphire 399 511 44,000 0.60 Monomer* 
Cyan Fluorescent Proteins 
ECFP 439 476 32,500 0.40 Monomer* 
mCFP 433 475 32,500 0.40 Monomer 
Cerulean 433 475 43,000 0.62 Monomer* 
CyPet 435 477 35,000 0.51 Monomer* 
AmCyanl 458 489 44,000 0.24 Tetramer 
Midori-Ishi 472 495 27,300 0.90 Dimer 
Cyan 
mTFP1 462 492 64,000 0.85 Monomer 
(Teal) 
Yellow Fluorescent Proteins 
EYFP 514 527 83,400 0.61 Monomer* 
Topaz 514 527 94,500 0.60 Monomer* 
Venus 515 528 92,200 0.57 Monomer* 
mCitrine 516 529 77,000 0.76 Monomer 
YPet 517 530 104,000 0.77 Monomer* 
PhiYFP 525 537 124,000 0.39 Monomer* 
ZsYellowl 529 539 20,200 0.42 Tetramer 
mBanana 540 553 6,000 0.7 Monomer 
Orange and Red Fluorescent Proteins 
Kusabira 548 559 51,600 0.60 Monomer 
Orange 
mOrange 548 562 71,000 0.69 Monomer 
dTomato 554 581 69,000 0.69 Dimer 
dTomato- 554 581 138,000 0.69 Monomer 
Tandem 
DsRed 558 583 75,000 0.79 Tetramer 
DsRed2 563 582 43,800 0.55 Tetramer 
DsRed- 555 584 38,000 0.51 Tetramer 
Express (T1) 
DsRed- 556 586 35,000 0.10 Monomer 
Monomer 
mTangerine 568 585 38,000 0.30 Monomer 
mStrawberry 574 596 90,000 0.29 Monomer 
AsRed2 576 592 56,200 0.05 Tetramer 
mRFP1 584 607 50,000 0.25 Monomer 
JRed 584 610 44,000 0.20 Dimer 
mCherry 587 610 72,000 0.22 Monomer 
HcRed1 588 618 20,000 0.015 Dimer 
mRaspberry 598 625 86,000 0.15 Monomer 
HcRed- 590 637 160,000 0.04 Monomer 
Tandem 
mPlum 590 649 41,000 0.10 Monomer 
AQ143 595 655 90,000 0.04 Tetramer 
*Weak Dimer 
[0177] D. Allelic Variants 
[0178] In certain aspects of the invention, the set of cell 


lines or the expression cassettes may further comprise addi- 
tional coding sequence for drug metabolizing enzyme or drug 
targets and variants thereof. One benefit of using pluripotent 
stem cells is the ability to make cells that are identical in all 
respects, except that they have a particular variation in the 
gene encoding a drug metabolizing enzyme or drug target of 
particular interest. This is relevant in the context of drug 


Relative 
Brightness 
(% of 
EGFP) 


110 
116 
121 


117 


screening, because there are some naturally occurring allelic 
variants that affect an individual’s ability to respond to or 
metabolize drugs of a particular class. Because the cells are 
otherwise the same, the user can determine the effect of the 
compound being screened in an allotype specific manner. See 
published U.S. patent application 2003/0003573. 

[0179] Examples of drug metabolizing enzymes having 
known allelic variants of consequence are described by Wolf 
et al., 2000; Wolf et al., 1999; and Webber, 1997. 
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TABLE 3 


Naturally Occurring Allotype Variants of Drug Metabolizing Enzymes 


Total 
Variant No. of Exemplary 
Enzyme phenotype Frequency Drugs Substrates 
CYP2D6 poor White 696; 7100 codeine, 
metabolizer African American 2%; nortryptiline, 
Oriental 1% dextro- 
methorphan 
ultra-rapid Ethiopian 20%; 
metabolizer Spanish 7%; 
Scandinavian 1.5% 
CYP2C9 reduced 760 tolbutamide, 
activity diazepam, 
ibuprophen, 
warfarin 
CYP2C19 poor Oriental 2396; 750 mephenytoin, 
metabolizer White 4% omeprazole, 
proguanil, 
citalopram 
N-acetyl poor White 6096; >15 isoniazid, 


transferase metabolizer African American 60%; 


Oriental 20%; 


procainamaide, 
sulphonamides, 


Inuit 5% hydralazines 
Thiopurine poor low in all populations <10 6-mercapto- 
methyl- metabolizer purine, 6- 
transferase thioguanine, 
azathioprine 
[0180] Another enzyme with known variants is CYP3A4, 


which plays a role in deactivating testosterone, and which is 
implicated in susceptibility to prostate cancer (Paris et al., 
1999). 

[0181] To put into effect this embodiment of the invention, 
pluripotent stem cells may be divided into two or more sepa- 
rate subsets. One or more of the cell lines may be genetically 
altered to introduce a variant of the gene for the drug metabo- 
lizing enzyme or drug target (before or after introduction of 
the exogenous expression cassette). The gene can be intro- 
duced by random transduction, but more typically the variant 
is substituted for the native gene by homologous recombina- 
tion. This both silences the endogenous gene, and places the 
variant under control of condition-responsive regulatory ele- 
ments, for example, cell-specific or inducible promoters. 
Alternatively, if a naturally occurring variant is known to 
differ from the usual gene by a point mutation, the endog- 
enous gene can be mutated so as to conferthe same phenotype 
while introducing a condition-responsive transcription regu- 
latory element for regulating the variant expression. The user 
has the option of altering the opposite allele to express the 
same variant, or inactivating it, for example, by homologous 
recombination. 

[0182] Thecells could then differentiated and used for drug 
screening as described in the sections that follow. 


VI. DELIVERY OF GENE OR GENE PRODUCTS 


[0183] In certain embodiments, vectors for delivery of 
nucleic acids encoding exogenous condition-responsive 
expression cassettes could be constructed to express these 
factors in cells. In a particular aspect, the following systems 
and methods may be used in delivery of expression cassette 
for identification of desired cell types. In particular, a set of 
stem cell lines may comprise a set of different expression 
cassettes, each expression cassette under the control of a 
different condition-responsive regulatory element for expres- 
sion in responsive to a defined condition, such as differentia- 
tion to a defined cell lineage. 
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[0184] B. Homologous Recombination 

[0185] In certain aspects of the invention, the exogenous 
expression cassettes such as condition-responsive expression 
cassettes or reprogramming cassettes may be introduced into 
cells in a specific manner, for example, via homologous 
recombination. Current approaches to express genes in stem 
cells have involved the use of viral vectors or transgenes that 
integrate randomly in the genome. These approaches have not 
been successful due in part because the randomly integrated 
vectors can activate or suppress endogenous gene expression, 
and/or the silencing of transgene expression. The problems 
associated with random integration could be partially over- 
come by homologous recombination to a specific locus in the 
target genome, e.g., a Rosa26 locus. The Rosa26 locus is 
easily accessible and amenable to homologous recombina- 
tion. Transgenes targeted by homologous recombination to 
the Rosa26 locus are stably and efficiently expressed in the 
undifferentiated cells as well as the differentiated cell types 
generated from stem cells such as mouse or human pluripo- 
tent stem cells. 

[0186] Homologous recombination (HR), also known as 
general recombination, is a type of genetic recombination 
used in all forms of life in which nucleotide sequences are 
exchanged between two similar or identical strands of DNA. 
The technique has been the standard method for genome 
engineering in mammalian cells since the mid 1980s. The 
process involves several steps of physical breaking and the 
eventual rejoining of DNA. This process is most widely used 
to repair potentially lethal double-strand breaks in DNA. In 
addition, homologous recombination produces new combi- 
nations of DNA sequences during meiosis, the process by 
which eukaryotes make germ cells like sperm and ova. These 
new combinations of DNA represent genetic variation in off- 
spring which allow populations to evolutionarily adapt to 
changing environmental conditions over time. Homologous 
recombination is also used in horizontal gene transfer to 
exchange genetic material between different strains and spe- 
cies of bacteria and viruses. Homologous recombination is 
also used as a technique in molecular biology for introducing 
genetic changes into target organisms. 

[0187] Homologous recombination can be used as targeted 
genome modification. The efficiency of standard HR in mam- 
malian cells is only 107? to 107? of cells treated (Capecchi, 
1990). The use of meganucleases, or homing endonucleases, 
suchas I-Scel have been used to increase the efficiency of HR. 
Both natural meganucleases as well as engineered meganu- 
cleases with modified targeting specificities have been uti- 
lized to increase HR efficiency (Pingoud and Silva, 2007; 
Chevalier et al., 2002). Another path toward increasing the 
efficiency of HR has been to engineer chimeric endonu- 
cleases with programmable DNA specificity domains (Silva 
etal., 2011). Zinc-finger nucleases (ZEN) are one example of 
such a chimeric molecule in which Zinc-finger DNA binding 
domains are fused with the catalytic domain of a Type IIS 
restriction endonuclease such as Fok] (as reviewed in Durai et 
al, 2005; PCT/US2004/030606). Another class of such 
specificity molecules includes Transcription Activator Like 
Effector (TALE) DNA binding domains fused to the catalytic 
domain of a Type IIS restriction endonuclease such as FokI 
(Miller et al., 2011: PCT/IB2010/000154). 

[0188] C. Nuclei Acid Delivery Systems 


[0189] One of skill in the art would be well equipped to 
construct a vector through standard recombinant techniques 
(see, for example, Sambrook et al., 2001 and Ausubel et al., 
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1996, both incorporated herein by reference). Vectors include 
but are not limited to, plasmids, cosmids, viruses (bacterioph- 
age, animal viruses, and plant viruses), and artificial chromo- 
somes (e.g., YACs), such as retroviral vectors (e.g., derived 
from Moloney murine leukemia virus vectors (MoMLV), 
MSCV, SFFV, MPSV, SNV etc), lentiviral vectors (e.g., 
derived from HIV-1, HIV-2, SIV, BIV, FIV etc.), adenoviral 
(Ad) vectors including replication competent, replication 
deficient and gutless forms thereof, adeno-associated viral 
(AAV) vectors, simian virus 40 (SV-40) vectors, bovine pap- 
illoma virus vectors, Epstein-Barr virus, herpes virus vectors, 
vaccinia virus vectors, Harvey murine sarcoma virus vectors, 
murine mammary tumor virus vectors, Rous sarcoma virus 
vectors. 

[0190] 2. Episomal Vectors 

[0191] The use of plasmid- or liposome-based extra-chro- 
mosomal (i.e., episomal) vectors may be also provided in 
certain aspects of the invention, for example, for reprogram- 
ming of somatic cells. Such episomal vectors may include, 
e.g., oriP-based vectors, and/or vectors encoding a derivative 
of EBNA-1. These vectors may permit large fragments of 
DNA to be introduced to a cell and maintained extra-chro- 
mosomally, replicated once per cell cycle, partitioned to 
daughter cells efficiently, and elicit substantially no immune 
response. 

[0192] In particular, EBNA-1, the only viral protein 
required for the replication of the oriP-based expression vec- 
tor, does not elicit a cellular immune response because it has 
developed an efficient mechanism to bypass the processing 
required for presentation of its antigens on MHC class I 
molecules (Levitskaya et al., 1997). Further, EBNA-1 can act 
in trans to enhance expression of the cloned gene, inducing 
expression of a cloned gene up to 100-fold in some cell lines 
(Langle-Rouault et al., 1998; Evans et al., 1997). Finally, the 
manufacture of such oriP-based expression vectors is inex- 
pensive. 

[0193] Other extra-chromosomal vectors include other 
lymphotrophic herpes virus-based vectors. Lymphotrophic 
herpes virus is a herpes virus that replicates in a lymphoblast 
(e.g., a human B lymphoblast) and becomes a plasmid for a 
part of its natural life-cycle. Herpes simplex virus (HSV) is 
not a “lymphotrophic” herpes virus. Exemplary lym- 
photrophic herpes viruses include, but are not limited to EBV, 
Kaposi’s sarcoma herpes virus (KSHV); Herpes virus saimiri 
(HS) and Marek’s disease virus (MDV). Also other sources of 
episome-base vectors are contemplated, such as yeast ARS, 
adenovirus, SV40, or BPV. 

[0194] One of skill in the art would be well equipped to 
construct a vector through standard recombinant techniques 
(see, for example, Maniatis et al., 1988 and Ausubel et al., 
1994, both incorporated herein by reference). 

[0195] Vectors can also comprise other components or 
functionalities that further modulate gene delivery and/or 
gene expression, or that otherwise provide beneficial proper- 
ties to the targeted cells. Such other components include, for 
example, components that influence binding or targeting to 
cells (including components that mediate cell-type or tissue- 
specific binding); components that influence uptake of the 
vector nucleic acid by the cell; components that influence 
localization of the polynucleotide within the cell after uptake 
(such as agents mediating nuclear localization); and compo- 
nents that influence expression of the polynucleotide. 

[0196] Such components also might include markers, such 
as detectable and/or selection markers that can be used to 
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detect or select for cells that have taken up and are expressing 
the nucleic acid delivered by the vector. Such components can 
be provided as a natural feature of the vector (such as the use 
of certain viral vectors which have components or function- 
alities mediating binding and uptake), or vectors can be modi- 
fied to provide such functionalities. A large variety of such 
vectors are known in the art and are generally available. When 
a vector is maintained in a host cell, the vector can either be 
stably replicated by the cells during mitosis as an autonomous 
structure, incorporated within the genome of the host cell, or 
maintained in the host cell’s nucleus or cytoplasm. 

[0197] 3. Transposon-Based System 

[0198] According to a particular embodiment the introduc- 
tion of nucleic acids may use a transposon—transposase sys- 
tem. The used transposon—transposase system could be the 
well known Sleeping Beauty, the Frog Prince transposon— 
transposase system (for the description of the latter see e.g., 
EP1507865), or the TTA A-specific transposon piggyBac sys- 
tem. 

[0199]  Transposons are sequences of DNA that can move 
around to different positions within the genome of a single 
cell, a process called transposition. In the process, they can 
cause mutations and change the amount of DNA in the 
genome. Transposons were also once called jumping genes, 
and are examples of mobile genetic elements. 

[0200] There are a variety of mobile genetic elements, and 
they can be grouped based on their mechanism of transposi- 
tion. Class I mobile genetic elements, or retrotransposons, 
copy themselves by first being transcribed to RNA, then 
reverse transcribed back to DNA by reverse transcriptase, and 
then being inserted at another position in the genome. Class II 
mobile genetic elements move directly from one position to 
another using a transposase to “cut and paste" them within the 
genome. 

[0201] 4. Viral Vectors 

[0202] In generating recombinant viral vectors, non-essen- 
tial genes are typically replaced with a gene or coding 
sequence for a heterologous (or non-native) protein. Viral 
vectors are a kind of expression construct that utilizes viral 
sequences to introduce nucleic acid and possibly proteins into 
acell. The ability ofcertain viruses to infect cells or enter cells 
via receptor-mediated endocytosis, and to integrate into host 
cell genome and express viral genes stably and efficiently 
have made them attractive candidates for the transfer of for- 
eign nucleic acids into cells (e.g., mammalian cells). Non- 
limiting examples of virus vectors that may be used to deliver 
a nucleic acid of certain aspects of the present invention are 
described below. 

[0203] Retroviruses have promise as gene delivery vectors 
due to their ability to integrate their genes into the host 
genome, transferring a large amount of foreign genetic mate- 
rial, infecting a broad spectrum of species and cell types and 
of being packaged in special cell-lines (Miller, 1992). 
[0204] In order to construct a retroviral vector, a nucleic 
acid is inserted into the viral genome in the place of certain 
viral sequences to produce a virus that is replication-defec- 
tive. In order to produce virions, a packaging cell line con- 
taining the gag, pol, and env genes but without the LTR and 
packaging components is constructed (Mann et al., 1983). 
When a recombinant plasmid containing a cDNA, together 
with the retroviral LTR and packaging sequences is intro- 
duced into a special cell line (e.g., by calcium phosphate 
precipitation for example), the packaging sequence allows 
the RNA transcript of the recombinant plasmid to be pack- 
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aged into viral particles, which are then secreted into the 
culture media (Nicolas and Rubenstein, 1988; Temin, 1986; 
Mann et al., 1983). The media containing the recombinant 
retroviruses is then collected, optionally concentrated, and 
used for gene transfer. Retroviral vectors are able to infect a 
broad variety of cell types. However, integration and stable 
expression require the division of host cells (Paskind et al., 
1975). 

[0205] Lentiviruses are complex retroviruses, which, in 
addition to the common retroviral genes gag, pol, and env, 
contain other genes with regulatory or structural function. 
Lentiviral vectors are well known in the art (see, for example, 
Naldini et al., 1996; Zufferey et al., 1997; Blomer et al., 1997; 
U.S. Pat. Nos. 6,013,516 and 5,994,136). 

[0206] Recombinant lentiviral vectors are capable of 
infecting non-dividing cells and can be used for both in vivo 
and ex vivo gene transfer and expression of nucleic acid 
sequences. For example, recombinant lentivirus capable of 
infecting a non-dividing cell wherein a suitable host cell is 
transfected with two or more vectors carrying the packaging 
functions, namely gag, pol and env, as well as rev and tat is 
described in U.S. Pat. No. 5,994,136, incorporated herein by 
reference. 

[0207] D. Nucleic Acid Delivery 

[0208] Introduction ofa nucleic acid, such as DNA or RNA, 
into cells to be programmed with the current invention may 
use any suitable methods for nucleic acid delivery for trans- 
formation ofa cell., as described herein or as would be known 
to one of ordinary skill in the art. Such methods include, but 
are not limited to, direct delivery of DNA such as by ex vivo 
transfection (Wilson et al., 1989, Nabel et al, 1989), by injec- 
tion (U.S. Pat. Nos. 5,994,624, 5,981,274, 5,945,100, 5,780, 
448, 5,736,524, 5,702,932, 5,656,610, 5,589,466 and 5,580, 
859, each incorporated herein by reference), including 
microinjection (Harland and Weintraub, 1985; U.S. Pat. No. 
5,789,215, incorporated herein by reference); by electropo- 
ration (U.S. Pat. No. 5,384,253, incorporated herein by ref- 
erence; Tur-Kaspa et al., 1986; Potter et al., 1984); by calcium 
phosphate precipitation (Graham and Van Der Eb, 1973; 
Chen and Okayama, 1987; Rippe et al., 1990); by using 
DEAE-dextran followed by polyethylene glycol (Gopal, 
1985); by direct sonic loading (Fechheimer et al., 1987); by 
liposome mediated transfection (Nicolau and Sene, 1982; 
Fraley et al., 1979; Nicolau et al., 1987; Wong et al., 1980; 
Kaneda et al., 1989; Kato et al., 1991) and receptor-mediated 
transfection (Wu and Wu, 1987; Wu and Wu, 1988); by micro- 
projectile bombardment (PCT Application Nos. WO 
94/09699 and 95/06128; U.S. Pat. Nos. 5,610,042; 5,322,783 
5,563,055, 5,550,318, 5,538,877 and 5,538,880, and each 
incorporated herein by reference); by agitation with silicon 
carbide fibers (Kaeppler et al., 1990; U.S. Pat. Nos. 5,302,523 
and 5,464,765, each incorporated herein by reference); by 
Agrobacterium-mediated transformation (U.S. Pat. Nos. 
5,591,616 and 5,563,055, each incorporated herein by refer- 
ence); by desiccation/inhibition-mediated DNA uptake (Pot- 
rykus et al., 1985), and any combination of such methods. 
Through the application of techniques such as these, 
organelle(s), cell(s), tissue(s) or organism(s) may be stably or 
transiently transformed. 

[0209] 2. Liposome-Mediated Transfection 

[0210] Ina certain embodiment of the invention, a nucleic 
acid may be entrapped in a lipid complex such as, for 
example, a liposome. Liposomes are vesicular structures 
characterized by a phospholipid bilayer membrane and an 
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inner aqueous medium. Multilamellar liposomes have mul- 
tiple lipid layers separated by aqueous medium. They form 
spontaneously when phospholipids are suspended in an 
excess of aqueous solution. The lipid components undergo 
self-rearrangement before the formation of closed structures 
and entrap water and dissolved solutes between the lipid 
bilayers (Ghosh and Bachhawat, 1991). Also contemplated is 
an nucleic acid complexed with Lipofectamine (Gibco BRL) 
or Superfect (Qiagen). The amount of liposomes used may 
vary upon the nature of the liposome as well as the cell used, 
for example, about 5 to about 20 ug vector DNA per 1 to 10 
million of cells may be contemplated. 


[0211] Liposome-mediated nucleic acid delivery and 
expression of foreign DNA in vitro has been very successful 
(Nicolau and Sene, 1982; Fraley et al., 1979; Nicolau et al., 
1987). The feasibility of liposome-mediated delivery and 
expression of foreign DNA in cultured chick embryo, HeLa 
and hepatoma cells has also been demonstrated (Wong et al., 
1980). 


[0212] Incertain embodiments ofthe invention, a liposome 
may be complexed with a hemagglutinating virus (HVJ). This 
has been shown to facilitate fusion with the cell membrane 
and promote cell entry of liposome-encapsulated DNA 
(Kaneda et al., 1989). In other embodiments, a liposome may 
be complexed or employed in conjunction with nuclear non- 
histone chromosomal proteins (HMG-1) (Kato et al., 1991). 
In yet further embodiments, a liposome may be complexed or 
employed in conjunction with both HVJ and HMG-1. In other 
embodiments, a delivery vehicle may comprise a ligand and a 
liposome. 


[0213] 


[0214] In certain embodiments of the present invention, a 
nucleic acid is introduced into an organelle, a cell, a tissue or 
anorganism via electroporation. Electroporation involves the 
exposure of a suspension of cells and DNA to a high-voltage 
electric discharge. Recipient cells can be made more suscep- 
tible to transformation by mechanical wounding. Also the 
amount of vectors used may vary upon the nature of the cells 
used, for example, about 5 to about 20 ug vector DNA per 1 to 
10 million of cells may be contemplated. 


[0215] Transfection of eukaryotic cells using electropora- 
tion has been quite successful. Mouse pre-B lymphocytes 
have been transfected with human kappa-immunoglobulin 
genes (Potter et al., 1984), and rat hepatocytes have been 
transfected with the chloramphenicol acetyltransferase gene 
(Tur-Kaspa et al., 1986) in this manner. 


[0216] 4. Calcium Phosphate 


[0217] In other embodiments of the present invention, a 
nucleic acid is introduced to the cells using calcium phos- 
phate precipitation. Human KB cells have been transfected 
with adenovirus 5 DNA (Graham and Van Der Eb, 1973) 
using this technique. Also in this manner, mouse L(A9), 
mouse C127, CHO, CV-1, BHK, NIH3T3 and HeLa cells 
were transfected with a neomycin marker gene (Chen and 
Okayama, 1987), and rat hepatocytes were transfected with a 
variety of marker genes (Rippe et al., 1990). 


[0218] 5. DEAE-Dextran 


[0219] In another embodiment, a nucleic acid is delivered 
into a cell using DEAE-dextran followed by polyethylene 
glycol. In this manner, reporter plasmids were introduced into 
mouse myeloma and erythroleukemia cells (Gopal, 1985). 


3. Electroporation 
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VII. CELL CULTURING 


[0220] Generally, cells ofthe present invention are cultured 
inaculture medium, which is a nutrient-rich buffered solution 
capable of sustaining cell growth. 

[0221] Culture media suitable for isolating, expanding and 
differentiating stem cells according to the method described 
herein include but not limited to high glucose Dulbecco's 
Modified Eagle's Medium (DMEM), DMEM/F-15, Liebo- 
vitz L-15, RPMI 1640, Iscove's modified Dubelcco's media 
(IMDM), and Opti-MEM SFM (Invitrogen Inc.). Chemically 
Defined Medium comprises a minimum essential medium 
such as Iscove's Modified Dulbecco's Medium (IMDM) 
(Gibco), supplemented with human serum albumin, human 
Ex Cyte lipoprotein, transfernin, insulin, vitamins, essential 
and non essential amino acids, sodium pyruvate, glutamine 
and a mitogen is also suitable. As used herein, a mitogen 
refers to an agent that stimulates cell division of a cell. An 
agent can be a chemical, usually some form of a protein that 
encourages a cell to commence cell division, triggering mito- 
sis. In one embodiment, serum free media such as those 
described in U.S. Ser. No. 08/464,599 and WO96/39487, and 
the *complete media" as described in U.S. Pat. No. 5,486,359 
are contemplated for use with the method described herein. In 
some embodiments, the culture medium is supplemented 
with 1096 Fetal Bovine Serum (FBS), human autologous 
serum, human AB serum or platelet rich plasma supple- 
mented with heparin (2 U/ml). Cell cultures may be main- 
tained in a CO, atmosphere, e.g., 596 to 1296, to maintain pH 
ofthe culture fluid, incubated at 37? C. ina humid atmosphere 
and passaged to maintain a confluence below 8596. 

[0222] Pluripotent stem cells to be differentiated may be 
cultured in a medium sufficient to maintain the pluripotency. 
Culturing ofinduced pluripotent stem (iPS) cells generated in 
certain aspects of this invention can use various medium and 
techniques developed to culture primate pluripotent stem 
cells, more specially, embryonic stem cells, as described in 
U.S. Pat. App. 20070238170 and U.S. Pat. App. 
20030211603. For example, like human embryonic stem 
(hES) cells, iPS cells can be maintained in 80% DMEM 
(Gibco #10829-018 or #11965-092), 20% defined fetal 
bovine serum (FBS) not heat inactivated, 1% non-essential 
amino acids, 1 mM L-glutamine, and 0.1 mM .beta.-mercap- 
toethanol. Alternatively, ES cells can be maintained in serum- 
free medium, made with 80% Knock-Out DMEM (Gibco 
#10829-018), 20% serum replacement (Gibco #10828-028), 
1% non-essential amino acids, 1 mM L-glutamine, and 0.1 
mM .beta.-mercaptoethanol. Just before use, human bFGF 
may be added to a final concentration of about 4 ng/mL (WO 
99/20741). 


VIII. DIFFERENTIATING CELLS TO A DESIRED 
TISSUE TYPE 


[0223] Once the pluripotent stem cells have been intro- 
duced with the exeogenous expression cassettes designed for 
condition-responsive expression in the test cell population, 
the population can be bulked up to any extent required, and 
then differentiated at will into the desired tissue type. 

[0224] B. Liver Cells 

[0225] Hepatocytes can be differentiated from pluripotent 
stem cells such as hES cells using an inhibitor of histone 
deacetylase, as described in U.S. Pat. No. 6,458,589 and PCT 
publication WO 01/81549 (Geron Corporation). Undifferen- 
tiated pluripotent stem cells may be cultured in the presence 
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of an inhibitor of histone deacetylase. In an exemplary 
method, differentiation is initiated with 1% DMSO, then with 
2.5 mM of the histone deacetylase inhibitor n-butyrate. The 
cells obtained can be matured by culturing 4 days in a hepa- 
tocyte culture medium containing n-butyrate, DMSO, plus 
growth factors such as EGF, hepatocyte growth factor, and 
TGF-a. 


[0226] Staged protocols for differentiating pluripotent stem 
cells such as hES cells into hepatocytes are described in US 
2005/0037493 A1 (Geron Corp.). Cells are cultured with 
several combinations of differentiation and maturation agents 
in sequence, causing the pluripotent stem cells such as hES 
cells to differentiate first into early endoderm or hepatocyte 
precursors, and then to mature hepatocyte-like cells. 

[0227] Differentiation into endoderm-like cells can be ini- 
tiated using either butyrate, DMSO or fetal bovine serum, 
optionally in combination with fibroblast growth factors. Dif- 
ferentiation can then continue using a commercially available 
hepatocyte culture medium, including factors such as hepa- 
tocyte growth factor (HGF), epidermal growth factor (EGF), 
and/or bone morphogenic protein (e.g., BMP-2, 4, or 7) in 
various combinations. Final maturation may be enhanced by 
the presence of agents such as dexamethazone or Oncostatin 
M. An illustration of the “DMSO Protocol” from US 2005/ 
0037493 A1, as applied to the reporter hepatocytes of this 
invention, is provided below in Example 3. Ina refined hepa- 
tocyte differentiation protocol, differentiation is initiated 
using a protein with Activin activity, typically in the presence 
of or sequentially with other factors like butyrate and/or 
DMSO (Example 6). The cells can then be matured in stages, 
using HGF, EGF, and/or BMP, enhanced by the presence of 
agents such as dexamethazone followed by Oncostatin M. 
[0228] Hepatocytes in certain aspects of this invention can 
be made by culturing pluripotent stem cells or other non- 
hepatocytes in a medium under conditions that increase the 
intracellular level of hepatocyte programming factors to be 
sufficient to promote programming of the cells into hepato- 
cytes (see U.S. Application No. 61/323,689, incorporated 
herein by reference). The medium may also contain one or 
more hepatocyte differentiation and maturation agents, like 
various kinds of growth factors. However, by increasing the 
intracellular level of hepatocyte programming transcription 
factors, aspects of the present invention bypass most stages 
toward mature hepatocytes without the need to change the 
medium for each of the stages. Therefore, in view of the 
advantages provided by the present invention, in particular 
aspects, the medium for culturing cells under hepatocyte pro- 
gramming may be essentially free of one or more of the 
hepatocyte differentiation and maturation agents, or may not 
undergo serial change with media containing different com- 
bination of such agents. 

[0229] These agents may either help induce cells to commit 
to a more mature phenotype—or preferentially promote sur- 
vival of the mature cells—or have a combination of both these 
effects. Hepatocyte differentiation and maturation agents 
illustrated in this disclosure may include soluble growth fac- 
tors (peptide hormones, cytokines, ligand-receptor com- 
plexes, and other compounds) that are capable of promoting 
the growth of cells of the hepatocyte lineage. Non-limiting 
examples of such agents include but are not limited to epider- 
mal growth factor (EGF), insulin, TGF-a, TGF-, fibroblast 
growth factor (FGF), heparin, hepatocyte growth factor 
(AGF), Oncostatin M (OSM), IL-1, IL-6, insulin-like growth 
factors I and II (IGF-I, IGF-2), heparin binding growth factor 
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1 (HBGF-1), and glucagon. The skilled reader will already 
appreciate that Oncostatin M is structurally related to Leuke- 
mia inhibitory factor (LIF), Interleukin-6 (IL-6), and ciliary 
neurotrophic factor (CNTF). 

[0230] An additional example is n-butyrate, as described in 
previous patent disclosures (U.S. Pat. No. 6,458,589, U.S. 
Pat. No. 6,506,574; WO 01/81549). Homologs of n-butyrate 
can readily be identified that have a similar effect, and can be 
used as substitutes in the practice of this invention. Some 
homologs have similar structural and physicochemical prop- 
erties to those of n-butyrate: acidic hydrocarbons comprising 
3-10 carbon atoms, and a conjugate base selected from the 
group consisting of a carboxylate, a sulfonate, a phosphonate, 
and other proton donors. Examples include isobutyric acid, 
butenoic acid, propanoic acid, other short-chain fatty acids, 
and dimethylbutyrate. Also included are isoteric hydrocarbon 
sulfonates or phosphonates, such as propanesulfonic acid and 
propanephosphonic acid, and conjugates such as amides, sac- 
charides, piperazine and cyclic derivatives. A further class of 
butyrate homologs is inhibitors of histone deacetylase. Non- 
limiting examples include trichostatin A, 5-azacytidine, 
trapoxin A, oxamflatin, FR901228, cisplatin, and MS-27- 
275. Another class of agents is organic solvents like DMSO. 
Alternatives with similar properties include but are not lim- 
ited to dimethylacetamide (DMA), hexmethylene bisaceta- 
mide, and other polymethylene bisacetamides. Solvents in 
this class are related, in part, by the property of increasing 
membrane permeability of cells. Also of interest are solutes 
such as nicotinamide. 

[0231] The term “hepatocyte” or “hepatocyte lineage cell” 
as used in this disclosure means a cell that has one or more, 
preferably at least three, and more preferably five or seven of 
the following characteristics: a,-antitrypsin; asialoglycopro- 
tein, glycogen storage, cytochrome P450 enzyme expression; 
glucose-6-phosphatase activity, low to negligible a-fetopro- 
tein, and morphological features of hepatocytes (cuboidal 
cells, possibly with canalicular spaces between them). Other 
features of mature hepatocytes isolated from human liver may 
be present, but are not required to qualify cells as hepatocytes 
within this definition. Assay methods for identifying cell 
markers are detailed in U.S. Pat. No. 6,458,589. A “hepato- 
cyte” of this invention may be but is not necessarily obtained 
by differentiating human embryonic stem cells, unless this is 
explicitly required. 

[0232] Inthe context of drug screening, the user may also 
wish to test the activity of particular drug metabolizing 
enzymes, such as cytochrome P450 enzymes. A convenient 
way of surveying the activity of cytochrome P450 is to com- 
bine the cells with a “cassette” of substrates: such as mida- 
zolam (metabolized by CYP3A4), tolbutamide (metabolized 
by CYP2C9) phenacetin (CYP1A2), and bufuralol 
(CYP2D6). Activity can be quantitated as being about 0.1, 1, 
or 10 times that of a reference cell line, such as HepG2 cells. 
A convenient way of monitoring metabolites of all the drugs 
in the cassette simultaneously is by GCMS. If desirable, the 
cells can be treated with compounds such as dexamethazone 
or Rifampicin before or during use in drug screening, so as to 
increase cytochrome P450 expression or activity in the cells. 
[0233] C. Nerve Cells 

[0234] Neural cells can be generated from pluripotent stem 
cells such as hES cells according to the method described in 
USS. Pat. No. 6,833,269; Carpenter et al., 2001; and WO 
03/000868 (Geron Corporation). Undifferentiated hES cells 
or embryoid body cells are cultured in a medium containing 
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one or more neurotrophins and one or more mitogens, gener- 
ating a cell population in which at least ~60% of the cells 
express A2BS, polysialylated NCAM, or Nestin and which is 
capable of at least 20 doublings in culture. Exemplary mito- 
gens are EGF, basic FGF, PDGF, and IGF-1. Exemplary 
neurotrophins are NT-3 and BDNF. The use of TGF-p Super- 
family Antagonists, or a combination of cAMP and ascorbic 
acid, can be used to increase the proportion of neuronal cells 
that are positive for tyrosine hydroxylase, a characteristic of 
dopaminergic neurons. The proliferating cells can then be 
caused to undergo terminal differentiation by culturing with 
neurotrophins in the absence of mitogen. 

[0235] Oligodendrocytes can be generated from pluripo- 
tent stem cells such as hES cells by culturing them as cell 
aggregates, suspended in a medium containing a mitogen 
such as FGF, and oligodendrocyte differentiation factors such 
as triiodothyronine, selenium, and retinoic acid. The cells are 
then plated onto a solid surface, the retinoic acid is with- 
drawn, and the population is expanded. Terminal differentia- 
tion can be effected by plating on poly-L-lysine, and remov- 
ing all growth factors. Populations can be obtained in which 
over 80% of the cells are positive for oligodendrocyte mark- 
ers NG2 proteoglycan, A2B5, and PDGFRa, and negative for 
the neuronal marker NeuN. See PCT publication WO 
04/007696 and Keirstead et al., 2005. Derivation of retinal 
pigment epithelial cells has also been reported (Klimanskaya 
et al., 2004). 

[0236] D. Heart Cells 

[0237] Cardiomyocytes or cardiomyocyte precursors can 
be generated from pluripotent stem cells such as hES cells 
according to the method provided in WO 03/006950. The 
cells are cultured in suspension with fetal calf serum or serum 
replacement, and optionally a cardiotrophic factor that affects 
DNA-methylation, such as 5-azacytidine. Alternatively, car- 
diomyocyte clusters can be generated by culturing on a solid 
substrate with Activin A, followed by culturing with a bone 
morphogenic protein like BMP4, and optionally by further 
culturing with an insulin-like growth factor like IGF-1. If 
desired, spontaneously contracting cells can then be sepa- 
rated from other cells in the population, by density centrifu- 
gation. 

[0238] Further process steps can include culturing the cells 
so as to form clusters known as Cardiac Bodies™, removing 
single cells, and then dispersing and reforming the Cardiac 
Bodies™ in successive iterations. Populations are obtained 
with a high proportion of cells staining positive for cTnl, 
cTnT, cardiac-specific myosin heavy chain (MHC), and the 
transcription factor Nkx2.5. See WO 03/006950, Xu et al., 
2002; and US 2005/0214939 A1 (Geron Corporation). 
[0239] E. Other Cell Types 

[0240] Islet cells can be differentiated from pluripotent 
stem cells such as hES cells (WO 03/050249, Geron Corp.) by 
initiating differentiation by culturing in a medium containing 
a combination of several factors selected from Activin A, a 
histone deacetylase inhibitor (such as butyrate), a mitogen 
(such as bFGF); and a TGF-f Superfamily antagonist (such as 
noggin). The cells can then be matured by culturing with 
nicotinamide, yielding a cell population in which at least 5% 
of the cells express Pdx1, insulin, glucagon, somatostatin, 
and pancreatic polypeptide. Cell clusters may form buds 
enriched for insulin producing cells, which can be recovered 
by filtering. See WO 03/050249 (Geron Corp.). 

[0241] Hematopoietic cells can be made by coculturing 
pluripotent stem cells such as hES cells with murine bone 
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marrow cells or yolk sac endothelial cells was used to gener- 
ate cells with hematopoietic markers (U.S. Pat. No. 6,280, 
718). Hematopoietic cells can also be made by culturing stem 
cells with hematogenic cytokines and a bone morphogenic 
protein, as described in US 2003/0153082 A1 and WO 
03/050251 (Robarts Institute). 

[0242] Mesenchymal progenitors and fibroblasts can be 
generated from pluripotent stem cells such as hES cells 
according to the method described in WO 03/004605. hES- 
derived mesenchymal cells can then be further differentiated 
into osteoblast lineage cells in a medium containing an osteo- 
genic factor, such as bone morphogenic protein (particularly 
BMP4), a ligand for a human TGF- receptor, or a ligand for 
a human vitamin D receptor (WO 03/004605; Sotile et al., 
2003). US 2004/0009589 A1 (Iskovitz-Elder et al.) and US 
2003/0166273 A1 (Kaufman et al., Wisconsin) report endot- 
helial cells derived from human embryonic stem cells. Chon- 
drocytes or their progenitors can be generated by culturing 
stem cells in microaggregates with effective combinations of 
differentiation factors listed in WO 03/050250 (Geron Corp.). 
[0243] Other differentiation methods known in the art or 
subsequently developed can be used in conjunction with this 
invention to create engineered cells representative of other 
tissues. 


IX. SCREENING PLATFORM AND METHODS 


[0244] The engineered cell population or cells derived 
therefrom in certain aspects of the invention can be used in a 
variety of applications. These include but not limited to study 
biological response or drug response; screening cytotoxic 
compounds, carcinogens, mutagens growth/regulatory fac- 
tors, pharmaceutical compounds, etc., in vitro; elucidating 
the mechanism or conditions of cell programming or devel- 
opment pathways; studying the mechanism by which drugs 
and/or growth factors operate; and the production of biologi- 
cally active products, to name but a few. 

[0245] B. Test Compound Screening 

[0246] Engineered cells or cells derived therefrom of cer- 
tain aspects of this invention can be used to screen for factors 
(such as solvents, small molecule drugs, peptides, and poly- 
nucleotides) or environmental conditions (such as culture 
conditions or manipulation) that affect the expression char- 
acteristics of exogenous expression caseettes comprising 
condition-responsive regulatory elements provided herein. 
[0247] In some applications, stem cells (differentiated or 
undifferentiated) are used to screen factors that promote 
maturation of cells along a selected cell lineage such as the 
hepatocyte lineage, or promote proliferation and mainte- 
nance of such cells in long-term culture. For example, candi- 
date hepatocyte maturation factors or growth factors are 
tested by adding them to stem cells in different wells, and then 
determining any phenotypic change that results, according to 
desirable criteria for further culture and use of the cells. 
[0248] Particular screening applications of this invention 
relate to the testing of pharmaceutical compounds in drug 
research. The reader is referred generally to the standard 
textbook In vitro Methods in Pharmaceutical Research, Aca- 
demic Press, 1997, and U.S. Pat. No. 5,030,015). In certain 
aspects ofthis invention, cells programmed to the hepatocyte 
lineage play the role of test cells for standard drug screening 
and toxicity assays, as have been previously performed on 
hepatocyte cell lines or primary hepatocytes in short-term 
culture. Assessment of the activity of candidate pharmaceu- 
tical compounds generally involves combining the hepato- 
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cytes provided in certain aspects of this invention with the 
candidate compound, determining any change in the mor- 
phology, marker phenotype, or metabolic activity of the cells 
that is attributable to the compound (compared with untreated 
cells or cells treated with an inert compound), and then cor- 
relating the effect ofthe compound with the observed change. 
The screening may be done either because the compound is 
designed to have a pharmacological effect on liver cells, or 
because a compound designed to have effects elsewhere may 
have unintended hepatic side effects. Two or more drugs can 
be tested in combination (by combining with the cells either 
simultaneously or sequentially), to detect possible drug-drug 
interaction effects. 

[0249] 2. Toxicity Testing 

[0250] | Use of the cells of this invention containing condi- 
tion-responsive expression cassettes in toxicity testing 
involves combining the cell population with the agent to be 
screened (typically by adding it to the medium). Examples of 
such a agent include, but need not be limited to, pharmaceu- 
tical compounds, agricultural chemicals, specialty chemicals, 
cosmetics and food additives. The effect of the agent on the 
exogenous expression cassette is followed typically by com- 
paring the signal from the marker gene in the presence and 
absence ofthe agent, using a detection system appropriate for 
the selectable or screenable marker chosen. 

[0251] By way of illustration, iPS cells are genetically 
modified and differentiated to create a population of hepato- 
cytes containing a promoter for heme oxygenase 1, linked to 
a green fluorescent protein reporter gene. The cells are com- 
bined with the test agent in the same medium, and fluores- 
cence is measured in comparison with fluorescence in the 
absence of the test agent. Increase in fluorescence level indi- 
cates that the heme oxygenase 1 gene is up-regulated, appar- 
ently in response to oxidative stress induced by the test agent. 
Different agents and agent combinations can be screened in a 
rapid throughput process, for example, by establishing the 
cells in the wells ofa microtiter plate. Agents tested according 
to this system can be identified and selected for further devel- 
opment, testing, or use because they do not cause substantial 
increase or alteration in the level of reporter expression 
(which means that if there is any effect attributable to the 
presence ofthe test agent, it is below a threshold that the user 
considers acceptable). 

[0252] Depending on the differentiation protocol, cell 
populations can be used that are at least 50%, 80%, or 90% 
homogeneous for the cell type of interest. Where the cell 
populations are relatively pure, or when the selected promoter 
is only active in the cell type of interest (e.g., the CYP3A4 
promoter in hepatocytes), then effects of the test agent on the 
target cell can be measured simply by following signal from 
the reporter gene in the cell population as a whole. 

[0253] However, when the cell populations are more het- 
erogeneous, and the promoter can be induced in more than 
one of the cell types present, then it may be preferable to 
follow the effect on a cell-by-cell basis. A cell that contains 
both a metabolic-responsive expression cassette and a tissue- 
specific expression cassette is equipped to do this particularly 
well. The test agent is combined with the cell population as a 
whole, but the output of the assay is measured as a change in 
the metabolic-responsive marker when present in a cell 
labeled with the tissue specific expression cassette. A benefit 
of this approach is that there is no need for the target cell type 
to predominate the reagent cell population. Populations com- 
prising less than 20%, 10%, or 5% of the target cells can be 
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used, since a drug-induced effect will be demonstrated if 
there are detectable cells in which both markers are 
expressed. This enables the drug screening techniques of this 
invention to be used with relatively rare cell types or sub- 
types—e.g., insulin-producing pancreatic islet cells, or neural 
cells that utilize a particular neurotransmitter. 

[0254] Cell populations equipped with a plurality of meta- 
bolic ortoxicologically responsive expression cassette (either 
as different exogenous expression cassettes in a single cell 
line, or in a population of mixed cells containing different 
exogenous expression cassettes) can be used to monitor mul- 
tiple assault pathways simultaneously, as long as the products 
of the selectable or screenable marker genes are distinguish- 
able. 

[0255] In some particular applications, compounds are 
screened initially for potential hepatotoxicity (Castell et al., 
1997). Cytotoxicity can be determined in the first instance by 
the effect on cell viability, survival, morphology, and leakage 
ofenzymes into the culture medium. More detailed analysis is 
conducted to determine whether compounds affect cell func- 
tion (such as gluconeogenesis, ureogenesis, and plasma pro- 
tein synthesis) without causing toxicity. Lactate dehydroge- 
nase (LDH) is a good marker because the hepatic isoenzyme 
(type V) is stable in culture conditions, allowing reproducible 
measurements in culture supernatants after 12-24 h incuba- 
tion. Leakage of enzymes such as mitochondrial glutamate 
oxaloacetate transaminase and glutamate pyruvate transami- 
nase can also be used. Gomez-Lechon et al. (1996) describes 
a microassay for measuring glycogen, which can be used to 
measure the effect of pharmaceutical compounds on hepato- 
cyte gluconeogenesis. 

[0256] Other current methods to evaluate hepatotoxicity 
include determination of the synthesis and secretion of albu- 
min, cholesterol, and lipoproteins; transport of conjugated 
bile acids and bilirubin; ureagenesis; cytochrome p450 levels 
and activities; glutathione levels; release of a-glutathione 
s-transferase; ATP, ADP, and AMP metabolism; intracellular 
K* and Ca?* concentrations; the release of nuclear matrix 
proteins or oligonucleosomes; and induction of apoptosis 
(indicated by cell rounding, condensation of chromatin, and 
nuclear fragmentation). DNA synthesis can be measured as 
[°H]-thymidine or BrdU incorporation. Effects of a drug on 
DNA synthesis or structure can be determined by measuring 
DNA synthesis or repair. [*H]-thymidine or BrdU incorpora- 
tion, especially at unscheduled times in the cell cycle, or 
above the level required for cell replication, is consistent with 
a drug effect. Unwanted effects can also include unusual rates 
of sister chromatid exchange, determined by metaphase 
spread. The reader is referred to Vickers (1997) for further 
elaboration. 


[0257] 
[0258] Besides screening test compounds for toxicology, 
drug metabolism, and disposition, the cells of this invention 
can also be used to screen for positive pharmacological effect. 
For example, pancreatic cells containing a selectable or 
screenable marker system driven by an insulin promoter can 
be used to screen drugs capable of inducing insulin secretion. 
Neuronal cells containing a selectable or screenable marker 
system driven by promoters for genes in neurotransmitter 
synthesis, release, or uptake can be used to screen drugs with 
a potentially beneficial neurological effect. The use of cells, 
kits, and methodology of this invention for positive screening 
parallels that of toxicity testing, selecting appropriate pro- 
moter constructs and adapting the assays as appropriate. 


3. Screening for Positive Pharmacological Effect 
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[0259] In another example, compounds can be tested for 
cytoprotection against another drug or culture condition. For 
example, cells containing a exogenous expression cassette for 
a gene upregulated in apoptosis or stress (like PUMA or heme 
oxygenase 1) are cultured in the presence of stressors such as 
menadione, tertiary butylhydroquinone (TBHQ), hydroper- 
oxidase, quinone, or abnormal oxygen levels to turn on the 
selectable or screenable marker signal. Once established, 
cells cultured with such stressors can be used to screen drugs 
that will prevent, lower, or reverse selectable or screenable 
marker signaling, thereby denoting a lower level of gene 
expression, and hence a protective effect. This can be used 
with pluripotent stem cell-derived cardiomyocytes, for 
example, to test drugs for suitability in treating cardiac 
ischemia. In tandem with screening of drugs for positive 
effects, matched populations of hepatocyte reporter cells can 
be used to screen for toxicological effects of the same com- 
pounds. 


[0260] 4. Validation of Drug Targets and Drug Metaboliz- 
ing Enzymes 
[0261] During the course of screening fora toxicological or 


pharmaceutical effect, the user may wish to validate the pre- 
sumed target of a particular drug, or an enzyme believed to be 
involved in its metabolism. This can be done by combining 
the drug with exogenous expression cassette-containing cells 
in the presence or absence of a substance that either activates 
or inhibits transcription or translation of the drug target or 
metabolizing enzyme. The exogenous expression cassette is 
chosen to reflect gene activity downstream from the activity 
being tested. The user then determines whether there is a 
difference in expression of the selectable or screenable 
marker gene in the presence of the drug with or without the 
RNAi, as an indication of whether the drug does influence the 
drug target or enzyme in question. 

[0262] Suitable inhibitors for use in this context are RNA 
molecules (RNAi) of the single or double stranded variety, 
having a sequence that enables it to inactivate translation in a 
gene specific manner. The synthesis and use of RNAi mol- 
ecules and other inhibitors suitable for use in this context are 
well described in the art. See, for example, Huan et al., Cancer 
Res. 64:4294, 2004; Chan et al., 2005; Manoharan, 2004; WO 
04/094595; WO 05/014782). Other suitable activators and 
inhibitors include small molecule drugs known to upregulate 
or downregulate the gene at the transcription level (Campbell 
et al., 1996). 

[0263] Known drug targets include G protein-coupled 
receptors (GPCRs), activated by ligands like TNF; peroxi- 
some proliferation-activated receptors (PPARs), which binds 
muraglitazar and other compounds; cytochrome P450 regu- 
lators like PXR, which are activated by dexamethazone, 
Rifampicin, or pregnenalone 16a-carbonitrile; the nuclear 
receptor CAR, which are activated by phenobarbital and other 
barbiturates; Phase II enzymes like glycosyl transferase, 
which process polychlorinated biphenyl compounds; aryl 
hydrocarbon (Ah) receptors, which bind benzo[a]pyrene and 
p-naphthoflavone; and estrogen receptors, which bind estro- 
gen analogs like tamoxifen. 

[0264] Known drug metabolizing enzymes include the 
cytochrome P450 system (Ortiz de Montellano et al., supra), 
N-acetyl transferase, and enzymes involved in conjugation of 
bile acids and other compounds. 

[0265] To illustrate this aspect of the invention, drug 
metabolism in the liver can be studied using hepatocytes 
having an exogenous expression cassette that responds to 
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oxidative stress. A drug that is metabolized through the cyto- 
chrome P450 system (e.g., phenobarbital) can be combined 
withthe cells in the presence and absence of RNAi specific for 
particular P450 enzymes like CYP3A4. If there is higher 
selectable or screenable marker activity induced by the drug 
in the presence of the RNAi, then the reduction in CYP3A4 
activity caused by the RNAi is resulting in increased stress— 
implicating CYP3A4 in the metabolic pathway of the drug. 
[0266] Ina similar fashion, role of the estrogen receptor in 
the pharmaceutical activity of a drug can be evaluated using 
cells having an exogenous expression cassette that reflects 
transcription of a gene up-regulated by estrogen. If there is 
lower selectable or screenable marker activity induced by the 
drug in the presence of RNAi specific for the estrogen recep- 
tor, then the estrogen receptor is validated as a target for the 
drug being tested. 

[0267] 5. Effect on Allelic Variants 

[0268] Exogenous expression cassette-containing cells 
made from the same pluripotent stem cell line but engineered 
to contain different variants of a drug metabolizing enzyme 
can be used to compare the processing or effect of a drug 
thought to be metabolized by the enzyme. For example, hepa- 
tocytes derived from the same iPS cell having the usual form 
of the CYP2D6 gene, can be compared with hepatocytes 
having the variant present in 6% of the population for the 
effect of a drug like dextromethorphan. Differences in drug 
metabolism attributable to the variation will affect the signal 
generated through an exogenous expression cassette that 
responds to metabolic or toxicologic changes in the cell, or 
reflects expression of a gene product implicated in metabo- 
lism of the drug. 

[0269] Ina similar fashion, cells engineered to contain dif- 
ferent variants of a drug target can be used to compare the 
effect of a drug on the target variants. For example, neuronal 
cells having variations in an enzyme involved in uptake of a 
neurotransmitter can be compared for the effect of a drug 
known to affect uptake (e.g., bupropion). Differences in the 
pharmacological effect of the drug attributable to the varia- 
tion will affect the signal generated through an exogenous 
expression cassette that responds to presence of the neu- 
rotransmitter. 

[0270] Separate cell populations having different variants 
ofthe drug target or drug metabolizing enzyme can be tested 
with the drug in parallel. Optionally, each variant can be 
placed in a cell population having different selectable or 
screenable marker genes. This enables the user to combine 
the two cell populations, and measure the effect ofthe drug on 
both variants together. 

[0271] C. Cells and Methods for Testing Programming 
[0272] To aid identification of desired cell types, the cells 
that comprise a cell-specific or tissue-specific marker expres- 
sion cassette may be used to test programming conditions, 
more particularly, differentiation conditions. The expression 
cassette may comprise a selectable or screenable marker 
operably linked to a transcriptional regulatory element spe- 
cific for the desired cell types. For example, the expression 
cassette may comprise a hepatocyte-specific promoter for 
hepatocyte production, isolation, selection, or enrichment. 
[0273] Therefore, in certain aspects, the ability ofa particu- 
lar candidate gene or a combination of candidate genes to act 
as programming factors for a specific cell type, such as hepa- 
tocytes or novel cell types that have never been made from 
programming such as differentiation of pluripotent stem 
cells, can be tested using the methods and cells provided in 
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this disclosure. Efficacy of particular candidate genes or com- 
binations of candidate genes in programming can be assessed 
by their effect on cell morphology, marker expression, enzy- 
matic activity, proliferative capacity, or other features of 
interest, which is then determined in comparison with parallel 
cultures that did not include the candidate genes or combina- 
tions. Candidate genes may be transcription factors important 
for differentiation into desired cell types or for function ofthe 
desired cell types. 

[0274] Incertain embodiments, starting cells, such as pluri- 
potent stem cells comprising condition-responsive expres- 
sion cassettes, may further comprise at least one expression 
cassette for expression of a candidate gene or a combination 
of candidate genes. The candidate expression cassette may 
comprisean externally controllable transcriptional regulatory 
element, such as an inducible promoter. The activity ofthese 
promoters may be induced by the presence or absence of 
biotic or abiotic factors. Inducible promoters are a very pow- 
erful tool in genetic engineering because the expression of 
genes operably linked to them can be turned on or off at 
certain stages of development ofan organism or ina particular 
tissue. Tet-On and Tet-Off inducible gene expression systems 
based on the essential regulatory components of the E. coli 
tetracycline-resistance operon may be used. Once established 
in the starting cells, the inducer doxycycline (Dox, a tetracy- 
cline derivative) could controls the expression system in a 
dose-dependent manner, allowing to precisely modulate the 
expression levels of candidate genes. 


VIII. EXAMPLES 


[0275] The following examples are included to demon- 
strate preferred embodiments of the invention. It should be 
appreciated by those of skill in the art that the techniques 
disclosed in the examples which follow represent techniques 
discovered by the inventors to function well in the practice of 
the invention, and thus can be considered to constitute pre- 
ferred modes for its practice. However, those of skill in the art 
should, in light ofthe present disclosure, appreciate that many 
changes can be made in the specific embodiments which are 
disclosed and still obtain a like or similar result without 
departing from the spirit and scope of the invention. 


Example 1 


Production of Engineered Stem Cell Lines 


[0276] The inventors contemplated a collection of engi- 
neered stem cell lines that can be used as a set to study any 
biological response in the human body. The base cell line for 
the construction of the set is an episomally derived iPS cell 
line made from a human tissue sample. Into this episomal 
line, a homologous recombination strategy has been used to 
introduce recombinase recognition sites into the Rosa 26 
locus of the parental iPS line. This strategy, disclosed in 
published US Patent Application No. 20100011455, is illus- 
trated in FIG. 1. The Rosa 26 targeting cassette was made so 
as to insert in between intron I and II in the native Rosa 26 
locus on human chromosome 3. The cassette included, in 5' to 
3' sequence, a 5' homologous arm for targeting, a spacer, a 
recombinase recognition site (white triangle), a protein cod- 
ing sequence from the thymidine kinase gene beginning with 
an ATG to start transcription, a 2À sequence, a second protein 
coding sequence for an antibiotic resistance gene for resis- 
tance to neomycin, a second recombinase recognition site 
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(black triangle) and a 3' homologous arm. The cassette is 
designed to be introduced into cells of the iPS line, with 
successful desired recombinant events being identified by 
resistance to neomycin. This cassette has been successfully 
transferred into the Rosa 26 locus of an episomally repro- 
grammed iPS line. This basal Rosa 26 knock-in line has been 
verified by PCR, expanded and banked in aliquots. 


[0277] With the basal Rosa 26 knock-in iPS line in hand, it 
then becomes convenient to introduce any desired genetic 
construct into this site. Sincethe Rosa 26 locus is expressed in 
essentially all tissues, expression at the locus is not repressed 
regardless ofthe cell lineage into which the iPS line is differ- 
entiated. 


[0278] Also shown in FIG. 1 are the elements of a second- 
ary engineered iPS line, made from this basal Rosa 26 iPS 
line. This particular line is constructed for selection of hepa- 
tocytes. First, a genetic construct was assembled which con- 
tained two expression cassettes, one cassette to permit selec- 
tion of the desired recombinant event, and one cassette to 
permit tissue specific selection of the desired tissue type, i.e., 
hepatocytes. At the 5' end of the construct, there was a left 
recombination recognition site, followed by a protein coding 
sequence for another antibiotic resistance gene, designated 
herein as the iPS selector. This coding sequence is driven by 
the native Rosa 26 promoter to permit successful desired 
recombinant cells to be identified by resistance to the antibi- 
otic for which the iPS selector confers resistance. Also in the 
construct, oriented in the opposite direction, is a construct 
including the promoter of alpha-1 antitrypsin (pAAT), which 
drives the expression of a second antibiotic selection gene, 
this one to be used to select cells when the cells have differ- 
entiated into hepatocytes. In this particular construct, there 
are also several enhancer elements (designated as ApoE1-4) 
which have been found to enhance the expression level of this 
particular promoter in hepatocytes. This construct has been 
built, transfected into the basal Rosa 26 iPS line, and antibi- 
otic resistant colonies have been recovered. Subsequent char- 
acterization will identify the proper insertion events and those 
clones will be expanded and banked. 


[0279] This same strategy can then be used to make each of 
the lines in the collection of lines envisioned herein. Shown in 
FIG. 2 is an example of the common format of design of the 
genetic constructions to go into the iPS lines of the collection. 
For each insertion, there is an iPS selector which permits 
selection of the desired recombinant insertion. For each inser- 
tion, there is a tissue specific promoter, the promoters being 
different in different elements of the set, but each of the 
promoters selected for tissue specific expression. The tissue 
specific expression will be in some instances an organ, e.g., 
pan cardiac, in some instances an organ subtype, e.g., atrial 
cell, in some cases a body wide cell type, e.g., endothelial cell, 
or in some instances a level of differentiation, e.g., a cardiac 
progenitor. The tissue specific promoter actuates expression 
of a second gene for resistance to a second antibiotic resis- 
tance gene, labeled a cell type selector in FIG. 2. The cell type 
selector is used to purify the cells of interest by enabling the 
survival of the cells which express the tissue specific pro- 
moter. A marker gene, such as a fluorescent protein, 
luciferase, a proprietary marker system, such as HaloTag or 
SNAP, is linked in expression to the cell type selector by a 2A 
linker, which works to co-express two distinct proteins driven 
by a common promoter. The collection will have a large 
number of different iPS cell lines, each engineered with a 
different tissue specific promoter element so that each line 
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either reports (fluorescence) or is selectable (antibiotic resis- 
tance), or both, when the conditional responsive promoter 
element in its construct is active. 


[0280] With this set of lines each of which is pre-engi- 
neered to become purifiable differentiated cells of a selected 
lineage, it then becomes possible to tag or mark any desired 
drug target, cell receptor or pathway in the cells. The number 
of known druggable target and pathways of interest to the 
pharmaceutical industry is reasonably small, less than 100 
pathways and targets. Vectors for each of those targets and 
pathways will be assembled into piggyBac vectors which 
contain genetic constructs that will exhibit a marker gene, 
such as a second distinct fluorescent marker, when the target 
or pathway is active in the cell. piggyBac vectors can readily 
be transformed efficiently into iPS cells without silencing and 
clones containing the piggyBac vectors can readily be iden- 
tified which have appropriate expression of conditionally 
responsive promoters in the piggyBac vectors. The set of 
piggyBac vectors can then be mixed and matched as needed 
with the set of iPS lines. The result is that the set of iPS lines 
permits differentiation and purification of any cell type in the 
human body for which a tissue specific promoter can be 
identified and the use of the piggyBac vector permits screen- 
ing for any druggable target or pathway in those cells. This 
system thus enables drug screening to be done on any cell 
type in the human body on any target that a pharmaceutical 
discovery effort might desire. All of the cells of the body and 
all of the pathways in those cells are now available for drug 
discovery in the most appropriate biological context possible 
outside of the human body itself. 


[0281] Another use for this set of iPS lines is for the dis- 
covery of differentiation processes. As the science of stem 
cells advances, slowly methods are being found to differen- 
tiate stem cells into many differentiated cell types. The tool 
involving a set of iPS lines as described above enables that 
process to be dramatically accelerated. By using an iPS line 
which will express its inserted marker gene when the cell 
differentiates into a given progeny cell, it now becomes pos- 
sible to perform random or semi random screens on differen- 
tiation conditions, since any condition which causes the 
undifferentiated stem cells to differentiate into the target cells 
of interest can be detected by the activation of the marker gene 
of FIG. 2. Once a single differentiation method is identified, 
even if it works at low efficiency, the same tool permits 
recursive experimentation on the initial method to be per- 
formed to increase efficiency and yield, while at all levels of 
efficiency of the process, purified cultures of the cells of 
interest can be simply obtained by antibiotic purification. So 
developing processes to produce purified cultures of any cell 
type in the body is now possible. 


[0282] Another use for the set of tool lines is for use as an 
assay of developmental toxicology. Since the iPS cells will 
exhibit the marker gene only when they differentiate into the 
target cell type, once a differentiation process is working at 
some level of efficiency, it is then possible to pertubate that 
process, by adding molecules which are potential develop- 
mentally toxic, to see if the molecules influence the yield of 
the target cells. For example, using the hepatocyte (liver) 
example of FIG. 1, under processes favoring the differentia- 
tion of hepatocytes, the cell line of FIG. 1 will yield anywhere 
from 20 to 70% hepatocytes, the efficiency level of which can 
be measured by the observed fluorescence from the cells as 
they become hepatocytes. For any process, the level will vary 
somewhat, but vary within limits about a statistical norm. It 


US 2015/0219627 A1 Aug. 6, 2015 
26 


then becomes possible to set up that process in multiwell [0303] U.S. Pat. No. 5,563,055 
culture plates and to add a potential teratogen or other poten- [0304] U.S. Pat. No. 5,580,859 
tial developmentally toxic agent to each well. The wells that [0305] U.S. Pat. No. 5,589,466 
fail to produce the normal yield of hepatocytes would indicate [0306] U.S. Pat. No. 5,591,616 
that the agent used in those particular wells is potentially [0307] U.S. Pat. No. 5,610,042 
harmful to the development of that cell type. This system can [0308] U.S. Pat. No. 5,610,042 
be replicated for many different cell types to identify those [0309] U.S. Pat. No. 5,635,387 
known or new agents which interfere with any form of devel- [0310] U.S. Pat. No. 5,656,610 
opmental biology. [0311] U.S. Pat. No. 5,677,136 
[0283] The most exhaustive variant would be a set of per- [0312] U.S. Pat. No. 5,681,599 
haps 10,000 lines where every promoter in the body which is [0313] U.S. Pat. No. 5,702,932 
differentially expressed in any cell type is included (i.e., [0314] U.S. Pat. No. 5,716,827 
excluding all promoters express similarly in all or most cells). [0315] U.S. Pat. No. 5,736,396 
This set could then be used to follow the expression charac- [0316] U.S. Pat. No. 5,736,524 
teristics of any promoter in any developmental pathway. [0317] U.S. Pat. No. 5.150.397 
[0284] This design would enable high-throughput screen- [0318] U.S. Pat. No. 5,759,793 
ing using the luciferase reporter, high-content imaging or [0319] US. Pat. No. 5.780. 448 
high-throughput FACS using the GFP reporter, or purification [0320] US. Pa Ü No. 578921 5 
using the antibiotic resistance gene, the expression of all of chic Eee C 
RO i 2 [0321] U.S. Pat. No. 5,811,094 
which is controlled by the genetic regulatory element in the [0322] US. Pat. No. 5,827,735 
genetic construct. ORO ers a 
[0285] In addition, examination of the variation in gene [0323] U.S. Pat. No. 5,827,740 
expression patterns of particular cell types can be examined [0324] U.S. Pat. No. 5,837,539 
with this tool. The inventors will construct a set of iPS cell [0325] U.S. Pat. No. 5,837,670 
lines with all the promoters from all the liver P450 genes. — [0326] U.S. Pat. No. 5,925,565 
Then the iPS cells will be differentiated to hepatocytes and be [0327] U.S. Pat. No. 5,935,819 
used to track the spectrum of P450 responses to an applied [0328] U.S. Pat. No. 5,945,100 
drug. [0329] U.S. Pat. No. 5,981,274 
ofthe methods disclosed and claimed herein can .S. Pat. No. 5,994, 
[0286]  Allofth hods disclosed and claimed herei [0330] U.S. Pat. No. 5,994,136 
be made and executed without undue experimentation in light [0331] U.S. Pat. No. 5,994,624 
of the present disclosure. While the compositions and meth- [0332] U.S. Pat. No. 6,013,516 
ods ofthis invention have been described in terms of preferred [0333] U.S. Pat. No. 6,280,718 
embodiments, it will be apparent to those of skill in the art that [0334] U.S. Pat. No. 6,458,589 


variations may be applied to the methods and in the steps or in [0335] U.S. Pat. No. 6,458,589 
the sequence of steps of the method described herein without [0336] U.S. Pat. No. 6,458,589 
departing from the concept, spirit and scope of the invention. [0337] U.S. Pat. No. 6,506,574 
More specifically, it will be apparent that certain agents which [0338] U.S. Pat. No. 6,833.269 
are both chemically and physiologically related may be sub- [0339] U.S. Pat. No 6.333 260 
stituted for the agents described herein while the same or [0340] US. Pat. No. 6.901.807 
similar results would be achieved. All such similar substitutes [0341] US. par No. 20i 5037 
and modifications apparent to those skilled in the art are Se A noe: 
deemed to be within the spirit, scope and concept of the [0342] U.S. Pat. No. 7,399,632 
invention as defined by the appended claims. [0343] U.S. Pat. No. 7,410,773 
[0344] U.S. Pat. No. 7,410,798 
REFERENCES [0345] U.S. Pat. No. 7,422,736 


[0346] U.S. Appln. 61/323,689 
[0347] U.S. application Ser. No. 08/464,599 
[0348] U.S. Appln. 61/172,079 
[0349] U.S. Appln. 61/184,546 


[0287] The following references, to the extent that they 
provide exemplary procedural or other details supplementary 
to those set forth herein, are specifically incorporated herein 
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1.-48. (canceled) 

49. A method of testing an effect of a compound on at least 
two distinct cell types with the same genetic background 
comprising: 

(a) obtaining an induced pluripotent stem (iPS) cell line; 

(b) engineering the iPS cell line with a first exogenous 

expression cassette comprising a selectable or screen- 
able marker under the control of a drug-responsive regu- 
latory element thereby producing an engineered iPS cell 
line; 

(c) differentiating the engineered iPS cell line into at least 

two distinct cell types; 

(d) contacting the at least two distinct cell types with the 

compound; and 
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(e) detecting the expression of the selectable or screenable 
marker in the at least two distinct cell types, thereby 
testing the compound against the at least two distinct cell 
types. 

50. The method of claim 49, wherein the iPS cell line is a 

human iPS cell line. 

51. The method of claim 49, wherein the iPS cell line is 
essentially free of exogenous retroviral genetic elements. 

52. The method of claim 49, wherein the first exogenous 
expression cassette is integrated into the genome of the engi- 
neered iPS cell line. 

53. The method of claim 52, wherein the first exogenous 
expression cassette is comprised at a predetermined location 
of the genome of the engineered iPS cell line. 

54. The method of claim 53, wherein the first exogenous 
expression cassette is comprised in a transposon system. 

55. The method of claim 49, wherein the drug-responsive 
regulatory element comprises a drug receptor, drug target, or 
drug signaling pathway-responsive regulatory element. 

56. The method of claim 49, wherein the drug-responsive 
regulatory element comprises a promoter of a drug metabo- 
lizing enzyme gene. 

57. The method of claim 56, wherein the promoter is a 
promoter of a gene encoding a cytochrome P450 monooxy- 
genase, N-acetyltransferase, thiopurine methyltransferase, or 
dihydropyrimidine dehydrogenase. 

58. The method of claim 56, wherein the drug-responsive 
regulatory element of the second expression cassette com- 
prises a drug signaling pathway-responsive promoter that 
causes expression of a screenable marker in a cell where a 
selected drug signaling pathway is activated. 

59. The method of claim 58, wherein the selected drug 
signaling pathway is a tyrosine kinase pathway, heterotrim- 
eric G protein pathway, small GTPase pathway, serine/threo- 
nine protein kinase pathway, phosphatase pathway, lipid 
kinase pathway, hydrolase pathway, cyclic AMP (cAMP)- 
mediated pathway, cyclic GMP (cGMP)-mediated pathway, 
phosphatidylinositol-triphosphate (PIP3)-mediated pathway, 
diacylglycerol (DAG)-mediated pathway, inositol-triphos- 
phate (IP3)-mediated pathway, EF hand domains of calmodu- 
lin-mediated signaling pathway, pleckstrin homology 
domains of the kinase protein AK T-mediated signaling path- 
way, chromatin regulation signaling pathway, MAPK signal- 
ing pathway, apoptosis/autophagy pathway, translational 
control pathway, cell cycle/checkpoint pathway, DNA dam- 
age pathway, Jak/Stat signaling pathway, NF-KB signaling 
pathway, TGF-B/Smad signaling pathway, lymphocyte sig- 
naling pathway, angiogenesis pathway, vesicle trafficking 
pathway, cytoskeletal signaling pathway, adhesion pathway, 
glucose metabolism pathway, Wnt/Hedgehog/Notch signal- 
ing pathway, stem cell lineage specification pathway, nuclear 
receptor-mediated pathway, or protein folding and stability 
signaling pathway. 

60. The method of claim 49, wherein the selectable marker 
comprises an antibiotic resistance gene or an antigenic 
epitope. 

61. The method of claim 49, wherein the screenable marker 
is further defined as a gene that expresses a fluorescent, lumi- 
nescent, or bioluminescent protein. 

62. The method of claim 49, wherein each of the at least 
two distinct cell types are contained in a separate container 
different from other cell types. 

63. The method of claim 49, wherein step (b) further com- 
prises engineering the iPS cell line with a second exogenous 
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expression cassette comprising a second selectable or screen- 
able marker under the control of a differentiation-responsive 
regulatory element. 

64. The method of claim 63, wherein the differentiation- 
responsive regulatory element comprises a tissue-specific 
promoter. 

65. The method of claim 63, wherein the differentiation- 
responsive regulatory element comprises a cell-specific pro- 
moter that causes expression of a selectable or screenable 
marker when the engineered iPS cell line differentiates to a 
selected cell lineage. 

66. The method of claim 63, wherein the second exogenous 
expression cassette is comprised in a transposon system. 

67. The method of claim 65, wherein the cell-specific pro- 
moter is a neural progenitor-specific promoter, a hepatocyte 
progenitor-specific promoter, a hematopoietic progenitor- 
specific promoter or a cardiac progenitor-specific promoter. 

68. The method of claim 65, wherein the cell-specific pro- 
moter is a promoter specific for a selected terminally differ- 
entiated cell. 

69. The method of claim 68, wherein the cell-specific pro- 
moter is a ventricular cardiomyocyte-specific promoter, an 
atrial cardiomyocyte-specific promoter, a nodal cardiomyo- 
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cyte-specific promoter an arterial endothelial cell-specific 
promoter, a venous endothelial cell-specific promoter, a lym- 
phatic endothelial cell-specific promoter, a blood-brain bar- 
rier endothelial cell-specific promoter, a dopaminergic neu- 
ron-specific promoter, a cholinergic neuron-specific 
promoter, a gabaergic neuron-specific promoter, or a motor 
neuron-specific promoter. 

70. The method of claim 64, wherein the tissue-specific 
promoter comprises a kidney-specific promoter, a kidney 
medulla-specific promoter, a kidney cortex-specific pro- 
moter, a heart-specific promoter, a pan-cardiac promoter, a 
heart atria-specific promoter, a heart ventricle-specific pro- 
moter, a liver-specific promoter, a neural-specific promoter, a 
pancreas-specific promoter, a lung-specific promoter, an 
endothelial-specific promoter, a blood-specific promoter or 
an intestine-specific promoter. 

71. The method of claim 63, wherein the second selectable 
and screenable marker and the first selectable and screenable 
marker can be selected or screened by the same method. 

72. The method of claim 71, wherein each marker is a 
fluorescent protein and each fluorescent protein has a differ- 
ent emission wavelength. 
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